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The present invention includes recombinant proteins derived from Clostridium botulinum toxins. In particular, soluble recombinant 
Clostridium botulinuni type A t type B and type E toxin proteins are provided. Methods which allow for the isolation of recombinant proteins 
free of significant endotoxin contamination are provided. The soluble, endotox in-free recombinant proteins are used as immunogens for 
the production of vaccines and antitoxins. These vaccines and antitoxins are useful in the treatment of humans and other animals at risk of 
intoxication with clostridial toxin. 
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MULTIVALENT VACCINE FOR CLOSTRIDIUM BOTULINUM NEUROTOXIN 

FIELD OF THE INVENTION 

The present invention relates to the isolation of polypeptides derived from Clostridium 
hotulinum neurotoxins and the use thereof as immunogens for the production of vaccines, 
including multivalent vaccines, and antitoxins. 

BACKGROUND OF THE INVENTION 

The genus Clostridium is comprised of gram-positive, anaerobic, spore-forming bacilli. 
The natural habitat of these organisms is the environment and the intestinal tracts of humans 
and other animals. Indeed. Clostridia are ubiquitous: they are commonly found in soil. dust, 
sewage, marine sediments, decaying vegetation, and mud. f.SVc e.g., P.H.A. Sneath et at.. 
"Clostridium " Betsey's Manual '» of Systematic Bacteriology. Vol. 2. pp, 1141-1200. 
Williams & Wilkins (1986).) Despite the identification of approximately 100 species of 
Clostridium, only a small number have been recognized as etiologic agents of medical and 
veterinary importance. Nonetheless, these species are associated with very serious diseases, 
including botulism, tetanus, anaerobic cellulitis, gas gangrene, bacteremia, pseudomembranous 
colitis, and clostridial gastroenteritis. Table 1 lists some of the species of medical and 
veterinary importance and the diseases with which they are associated. As virtually all of 
these species have been isolated from fecal samples of apparently healthy persons, some of 
these isolates may be transient, rather than permanent residents of the colonic ilora. 



TABLE I 

Clostridium Species 01* Medical And Veterinary Importance* 



Species 


Disease 


('. itmtnont/cricum 


Bacteriuria (pregnant women) 


C \ urgent incuse 


Infected wounds: Bacteremia: Botulism: Infections of amniotic fluid 


C. haratii 


Infected war wounds: Peritonitis: Infectious processes of the eye. ear 
and prostate 


C '. heijerinckikii 


Infected wounds 


C. hifcrnwnutm 


Infected wounds: Abscesses: Gas Gangrene: Bacteremia 


(\ hntuiimtttt 


Food poisoning: Botulism (wound, food, infant) 


C. htttyricum 


Urinary tract, lower respiratory tract, pleural cavity, and abdominal 
infections: Infected wounds: Abscesses: Bacteremia 


C. viutaveris 


Abscesses: Infected wounds 
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TABLE ! 

Clostridium Species Of Medical And Veterinary Importance* 





Species 


Disease 




(.', Lornis 


Soft tissue infections: Bacteremia 




C. cftunvtmi 


Blackleg 




( '. vlostndiofornw 


Abdominal, cervical, scrotal, pleural, and other infections; Septicemia 
Peritonitis; Appendicitis 




f ' / i 


Isolated from human disease processes, but role in disease unknown 


S 


( *. difficile 


Antimicrobial-associated diarrhea: Pseudomembranous enterocolitis: 
Bacteremia: Pyogenic infections 




(' fuliax 


Soft tissue infections 




C. jzhnoii 


Soft tissue infections 




( '. glycol uum 


Wound infections: Abscesses; Peritonitis 




I '. 1 nt.su forme 


Infected war wounds" Bacteremia* Ah^rpvLci^ 


10 


(' hisiolytitrum 


Infected war wounds* (ins i»;inirr(»m»- * im«ii%v<»i n i-i, 

""iiiiu.i, um i hi. , ^Miiuivdi ptaciuc isolate 




( \ ithiuhs 


Gastrointestinal tract infections 




( ' innocuum 


Gastrointestinal tract infections: Empyema 




< ' trvc£ulurv 


Penile lesions 




( . U fHum 


Isolated from human disease processes, but role in disease unknown 


15 


( ' Hniosum 


Bacteremia: Peritonitis: Pulmonary infections 




( muhnom outturn 


Various infectious processes 




( '. mnyt 


Infected wounds; Gas gangrene: Blackleg. Big head (ovine): Kedwater 
disease (bovine) 




( . ( tt't // tt'H/fi 


L'rinary tract infections: Rectal abscesses 




('■ iHtraputrifiatm 


Bacteremia: Peritonitis: Infected wounds; Appendicitis 


20 


( ' per/ringem 


Gas gangrene: Anaerobic cellulitis: Intra-abdominal abscesses: Soft 
tissue infections: Pood poisoning: Necrotizing pneumonia: Lmpvcma: 
Meningitis; Bacteremia: Uterine Infections: Enteritis uecrotans: Lamb 
dysentery: Struck: Ovine Emcrotoxemia: 




( punvfuciens 


Bacteriuria (Pregnant women with bacteremia) 




C. fnttnfiaim 


Abscesses: Infected wounds; Bacteremia 




('. rummttM 


Infections of the abdominal cavity, genital tract, lung, und biliary tract; 
Bacteremia 




( sunu^oformc 


Isolated from human disease processes, but role in disease unknown 


25 


C. scfUicum 


Gas gangrene: Bacteremia: Suppurative injections; Necroti/ini: 
enterocolitis: Braxy 




C. sordcllii 


Gas gnngrene; Wound infections: Penile lesions: Bacteremia; 
Abscesses: Abdominal and vaginal infections 



- ? - 
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TABLE ! 



Species 


Disease 


( \ sfihenotdes 


Appendicitis; Bacteremia: Bone and soft tissue infections: 
Intraperitoneal infections: Infected war wounds: Visceral iias uangrene: 
Renal abscesses ~ * 


( '. spttrtigaivs 


Gas gangrene: Bacteremia: Endocarditis: central nervous svstcm and 
pleuropulmonary infections: Penile lesions: Infected war wounds: 
Other pyogenic infections 


C \ snht emanate 


Bacteremia: Empyema: Biliary tract, soft tissue and bone infections 


( '. symhiosum 


Liver abscesses: Bacteremia: Infections resulting due to howcl flora 


( \ fi f tlltOt 


Gas gangrene: Appendicitis: Brain abscesses: Intestinal tract and soft 
tissue infections: Infected war wounds: Periodontics: Bacteremia 


( '. let unt 


Tetanus: Infected gums and tceih: Corneal ulcerations: Mastoid and 
middle ear infections: Intraperiloneal infections: Tetanus neonatorum: 
Postpartum uterine infections: Soft tissue infections, especially related 
to trauma (including abrasions and lacerations >: Infections related to 
use of contaminated needles 


( (hcrwuMtcchun }f\ t lentil 


Isolated from human disease processes, bin role in disease unknown 



15 



C ompiled from V.C, Lngelkirk c, <// TlawficaU,,,,". Principle* and Pmcuce ,rf Clinical Anacrohc 
liacerwla^. pp. 22-23. Slar Publishing Co.. liclmonl. CA (1992): .1. Stephen and K A I'etrowski 
Inxms Much Traverse Memhrancs and Dcregulule Cells." in Ikaicrial Taxim \\ ed np 66-67' 
American Soe.e.y tor Microbiology ( 1986): II. Berkow and A.J. Hetchcr <eds.). "few,^,, 

wTn^T 'I'?!"*"'"" 16,11 Cd - PP ' l6 -' 2f> - Mcrck Research l-atorniories. Rah wav 

VJ. ( IW2): and O il. S.gmtind and CM. Fraser (eds.). "Cln.uridn.l In/eaions." Mcrck Vccrman ' 
Manual. .Mh ed.. pp. 396-109. Mcrck & Co.. Rahway. N.J. (1979). 



In mosi cases, the pathogen ieiiy of ihesc onanisms is related to the release of powerful 
exotoxins or highly destructive enzymes. Indeed, several species of the genus ( 'h,.mulium 
produce toxins and other enzymes of great medical and veterinary significance. |C.L. 
Ilatheway. Clin. Microbiol. Rev. 3:66-98 ( 1 990). | 

Perhaps because of their significance lor human and veterinary medicine, much 
research has been conducted on these toxins, in particular those of C hondimnn and ('. 
tlifficih: 



C. botutimim 

Several strains of C .'hxiriiHum hotulimm produce toxins of significance to human and 
animal health. |C.L. I latheway. Clin. Microbiol. Rev. 3:66-98 (!990)| The effects of these 
toxins range from diarrheal diseases that can cause destruction of the colon, to paralytic 
effects that can cause death. Particularly at risk for developing clostridial diseases are 
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neonates and humans and animals in poor health (e.g.. those suffering from diseases 
associated with old age or immunodeficiency diseases). 

Clostridium hotulinum produces the most poisonous biological toxin known. The 
lethal human dose is a mere 10"' mg/kg bodyweight for toxin in the bloodstream. Botulinal 
5 toxin blocks nerve transmission to the muscles, resulting in flaccid paralysis. When the toxin 
reaches airway and respiratory muscles, it results in respiratory failure that can cause death. 
|S. Anion. J. Infect. Dis. 154:201-206 (I986)| 

C hotulinum spores are carried by dust and are found on vegetables taken from the 
soil, on fresh fruits, and on agricultural products such as honey. Under conditions favorable 

10 to the organism, the spores germinate to vegetative cells which produces toxin. [S. Anion. 
Ann. Rev. Med. 31:541 (1980)) 

Botulism disease may be grouped into four types, based on the method of introduction 
ot toxin into the bloodstream. Food-borne botulism results from ingesting improperly 
preserved and inadequately heated food that contains botulinal toxin. There were 355 cases of 

15 food-borne botulism in the United States between 1976 and 1984. |K.L. MacDonald et uL. 

Am. J. Kpidemiol. 124:794 (1986).) The death rate due to botulinal toxin is 12% and can be 
higher in particular risk groups. [CO. Tacket et <//.. Am. J. Med. 76:794 ( I984).| Wound- 
induced botulism results from C hotulinum penetrating traumatized tissue and producing toxin 
thai is absorbed into the bloodstream. Since 1950, thirty cases of wound botulism have been 

20 reported. | M.N. Swart/. "Anaerobic Spore-Forming Bacilli The ( lostridicC pp. 633-646. in 
I3.D. Davis et <//..(eds.). Microbiology. 4th edition. .1.13. I.ippincott Co. (1990).] Inhalation 
botulism results when the toxin is inhaled. Inhalation botulism has been reported as the result 
of accidental exposure in the laboratory |P:. Holzer. Med. Klin. 41:1735 (1962)) and could 
arise if the toxin is used as an agent of biological warfare |I).R. I ran/ ct aL. in Botulinum 

25 and Tetanus Xeurotoxins. 13. R. DasGupta. ed.. Plenum Press. New York (1993). pp. 473-476]. 
Inleclious infant botulism results from ( '. hotulinum colonization of the infant intestine with 
production of toxin and its absorption into the bloodstream. It is likely that the bacterium 
gams entry when spores are ingested and subsequently germinate. |S. Arnon. J. Infect. Dis. 
154:201 (1 986). | There have been 500 eases reported since it was first recognized in 1976. 

30 | M.N. Swarlz. supra.] 
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Infant botulism strikes infants who are three weeks to eleven months old (greater than 
90% of the cases are infants less than six months). |S. Arnon. J. Infect. Dis. 154:201 (I986).| 
It is believed that infants are susceptible, due. in large part, to the absence of the full adult 
complement of intestinal microflora. The benign microflora present in the adult intestine 
provide an acidic environment that is not favorable to colonization by C hoiulinum. Infants 
begin life with a sterile intestine which is gradually colonized by microflora. Because of the 
limited microflora present in early infancy, the intestinal environment is not as acidic, 
allowing for C hnmlhmm spore germination, growth, and toxin production. In this regard, 
some adults who have undergone antibiotic therapy which alters intestinal microflora become 
more susceptible to botulism. 

An additional factor accounting for infant susceptibility to infectious botulism is the 
immaturity of the infant immune system. The mature immune system is sensitized to 
bacterial antigens and produces protective antibodies. Secretory IgA produced in the adult 
intestine has the ability to agglutinate vegetative cells of C homlimtm. |S. Arnon. J. Infect. 
Dis. 154:201 <I986).| Secretory IgA may also act by preventing intestinal bacteria and their 
products from crossing the cells of the intestine. [S. Arnon. Epidemiol. Rev. 3:45 (1981 ).| 
I he infant immune system is not primed to do this. 

Clinical symptoms of infant botulism range from mild paralysis, to moderate and 
severe paralysis requiring hospitalization, to fulminant paralysis, leading to sudden death. [S. 
Arnon. Epidemiol. Rev. 3:45 (I "8 1).] 

The chief therapy for severe infant botulism is ventilatory assistance using a 
mechanical respirator and concurrent elimination of toxin and bacteria using cathartics, 
enemas, and gastric lavage. There were 68 hospitalizations in California for infant botulism 
in a single year with a total cost of over $4 million for treatment. [T.L. I-rankovich and S. 
Arnon. West. .1. Med. 154:103 (I99|).| 

Different strains of Clostridium hoiulinum each produce antigenically distinct toxin 
designated by the letters A-G. Serotype A toxin has been implicated in 26% of the cases of 
lood botulism: types B. E and F have also been implicated in a smaller percentage of the food 
botulism cases |II. Sugiyama. Microbiol. Rev. 44:419 (1980)]. Wound botulism has been 
reportedly caused by only types A or B toxins fH. Sugiyama. supm\. Nearly all cases of 
infant botulism have been caused by bacteria producing either type A or type i) toxin. 
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(Exceptionally, one New Mexico case was caused by Clostridium hotittinum producing type F 
toxin and another by Clostridium hotidimtm producing a type U-type F hybrid.) fS. Arnon. 
Epidemiol. Rev. 3:45 (1981).) Type C toxin affects waterfowl, cattle, horses and mink. Type 
I) toxin affects cattle, and type E toxin affects both humans and birds. 
* s A trivalent antitoxin derived from horse plasma is commercially available from 

Connaught Industries Ltd. as a therapy for toxin types A. B. and I*. However, the antitoxin 
has several disadvantages. First, extremely large dosages must be injected intravenousk 
and/or intramuscularly. Second, the antitoxin has serious side effects such as acute 
anaphylaxis which can lead to death, and serum sickness. Finally, the efficacy of the 
10 antitoxin is uncertain and the treatment is costly. |C.(). Tacket ct <//.. Am. J. Med. 76:7 ( )4 
(1 984). | 

A hcpiavalent equine botulinal antitoxin which uses only the F(ab')2 portion of the 
antibody molecule has been tested by the United States Military. |M. Baladv. USAMRDC 
Newsletter, p. 6 ( 1 991 ).| This was raised against impure toxoids in those large animals and is 
15 not a high titer preparation, 

A pentavalent human antitoxin has been collected from immunized human subjects for 
use as a treatment for infant botulism. The supply of this antitoxin is limited and cannot be 
expected to meet the needs of all individuals stricken with botulism disease. In addition, 
collection of human sera must involve screening out I II V and other potentially serious human 

20 pathogens. Sehwarz and S.S. Anion, Western J. Med. 156:197 (1992).) 

Infant botulism has been implicated as the cause of mortality in some cases of Sudden 
Infant Death Syndrome (S1DS. also known as crib death). SIDS is officially recognized as 
infant death that is sudden and unexpected and that remained unexplained despite complete 
post-mortem examination. The link of SIDS to infant botulism came when fecal or blood 

21 specimens taken at autopsy from SIDS infants were found to contain (*. botulimmi organisms 
and/or toxin in 3-4% of cases analyzed. [D.R. Peterson ct Rev. Infect. Dis. 1:630 
(1979). | In contrast, only I of 160 healthy infants (0.6%) had C hotttlimtm organisms in the 
feces and no botulinal toxin. (S. Arnon et «/.. Lancet, pp. 1273-76. June 17. 1978.) 

In developed countries. SIDS is the number one cause of death in children between 
30 one month and one year old. (S. Anion ct <//.. Lancet, pp. 1273-77. June 17. 1978.) More 
children die from SIDS in the first year than from any other single cause of death in the first 
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fourteen years of life. In the United States, there are 8.000-10,000 SIDS victims annually. 
Id 

What is needed is an effective therapy against infant botulism that is free of dangerous 
side effects, is available in large supply at a reasonable price, and can be safely and gently 
delivered so that prophylactic application to infants is feasible. 

Immunization of subjects with toxin preparations has been done in an attempt to 
induce immunity against botulinal toxins. A C. batulinum vaccine comprising chemicallv 
inactivated {i.e.. formaldehyde-treated) type A. B. C. D and F toxin is commercially available 
lor human usage. However, this vaccine preparation has several disadvantages. First, the 
eltlcacy of this vaccine is variable (in particular, only 78% of recipients produce protective 
levels of anti-type 13 antibodies following administration of the primary series). Second, 
immunization is painful (deep subcutaneous inoculation is required for administration), with 
adverse reactions being common (moderate to severe local reactions occur in approximately 
<>% ot recipients upon initial injection: this number rises to approximately 1 1% of individuals 
who receive booster injections) [Informational Brochure for the Pentavalent ( ABCDF) 
Botulinum Toxoid. Centers for Disease C ontrol]. Third, preparation of the vaccine is 
dangerous as active toxin must be handled by laboratory workers. 

V\hat is needed are safe and effective vaccine preparations for administration to those 
at risk of exposure to ( ' hoiu/inum toxins. 



C. difficile 

( difficile, an organism which gained its name due to difficulties encountered in its 
isolation, has recently been proven to be an etiologic agent of diarrheal disease. (Sneath et 
ai. p. I 165.). difficile is present in the gastrointestinal tract of approximately 3% of 
healthy adults, and 10-30% of neonates without adverse effect (Swartz. at p. 644): by other 
estimates. C. difficile is a part of the normal gastrointestinal flora of 2-10% of humans. (G.F 
Brooks et <//.. (eds.) "Infections Caused hy Anaerobic Bacteria" Jauvrz. Melnick d; 
Adelberg's Medical Microbiology. 1 9th ed.. pp. 257-262. Appleton & Lange. San Mateo. CA 
( IWI >.| As these organisms are relatively resistant to most commonly used antimicrobials, 
when a patient is treated with antibiotics, the other members of the normal gastrointestinal 
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flora are suppressed and C. difficile flourishes, producing cytopathic toxins and enterotoxins. 
It has been found in 25% of cases of moderate diarrhea resulting from treatment with 
antibiotics, especially the cephalosporins, clindamycin, and ampicillin. [M.N. Swartz at 644.1 

Importantly. C. difficile is commonly associated with nosocomial infections. The 
organism is often present in the hospital and nursing home environments and may be carried 
on the hands and clothing of hospital personnel who care for debilitated and 
immunocompromised patients. As many of these patients are being treated w ith 
antimicrobials or other chemotherapeutic agents, such transmission of C. difficile represents a 
significant risk factor for disease. (Engelkirk ei «/.. pp. 64-67.) 

C difficile is associated with a range of diarrhetic illness, ranging from diarrhea alone 
to marked diarrhea and necrosis of the gastrointestinal mucosa with the accumulation of 
inflammatory cells and fibrin, which forms a pseudomembranc in the affected area. (Brooks 
ci <//.) It has been found in over 95% of pseudomembranous enterocolitis cases. (Swartz. at 
p. 644.) This occasionally fatal disease is characterized by diarrhea, multiple small colonic 
plaques, and toxic megacolon. (Swartz. at p. 644.) Although stool cultures are sometimes 
used for diagnosis, diagnosis is best made by detection of the heat labile toxins present in 
fecal f.ltrates from patients with enterocolitis due to ( difficile. (Swartz. at p. 644-645: and 
Brooks ci ai. at p. 260.) C difficile toxins are cytotoxic for tissue/cell cultures and cause 
enterocolitis when injected intrucecally into hamsters. (Swartz. at p. 644.) 

The enterotoxicity of ( '. difficile is primarily due to the action of two toxins, 
designated A and B. each of approximately 300.000 in molecular weight. Both are potent 
cytotoxins. with toxin A possessing direct cnterocytotoxic activity. |l.ycrly ei al.. Infect. 
Immun. 60:4633 (1992).| Unlike toxin A of C per, 'rumens, an organism rarely associated 
with antimicrobial-associated diarrhea, the toxin ofC difficile is not a spore coat constituent 
and is not produced during sporulalion. (Swartz. at p. 644.) C difficile toxin A causes 
hemorrhage, fluid accumulation and mucosal damage in rabbit ileal loops and appears to 
increase the uptake of toxin B by the intestinal mucosa. Toxin B does not cause intestinal 
fluid accumulation, but it is 1000 times more toxic than toxin A to tissue culture cells and 
causes membrane damage. Although both toxins induce similar cellular effect s such as actin 
disaggregation, differences in cell specificity occurs. 
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Both toxins are important in disease. [Borriello ei ai. Rev. Infect. Dis.. 12(suppl. 
2):S185 (1990): Lyerly ei a/.. Infect. Immun.. 47:349 (1985): and Rolfe. Infect. Immun.. 
.">9: 1223 (1 990). | Toxin A is thought to act first by binding to brush border receptors, 
destroying the outer mucosal layer, then allowing toxin B to gain access to the underlying 
tissue. These steps in pathogenesis would indicate that the production of neutralizing 
antibodies against toxin A may be suftlcient in the prophylactic therapy of CDAD. However, 
antibodies against toxin B may be a necessary additional component for an effective 
therapeutic against later stage colonic disease. Indeed, it has been reported that animals 
require antibodies to both toxin A and toxin B to be completely protected against the disease. 
|kim and Rolfe. Abstr. Ann. Meet. Am. Soc. Microbiol.. 69:62 (1987).] 

C difficile has also been reported to produce other toxins such as an enterotoxin 
different from toxins A and B |Banno ei of. Rev. Infect. Dis.. 6(SuppI. I SI I-S20 ( I984)|. a 
low molecular weight toxin | Rihn el a/.. Biochein. Uiophys. Res. Comm.. 124:690-695 
1 1 984 )|. a motility altering factor [Justus ei al.. Gastroenterol.. 83:836-843 ( I982)|. and 
perhaps other toxins. Regardless. C. difficile gastrointestinal disease is of primary concern. 

It is significant that due to its resistance to most commonly used antimicrobials. ('. 
difficile is associated with antimicrobial therapy with virtually all antimicrobial agents 
(although most commonly ampicillin. clindamycin and cephalosporins). It is also associated 
with disease in patients undergoing chemotherapy with such compounds as methotrexate. 5- 
lluorouracil. cyclophosphamide, and doxorubicin. |S.M. FinegokUv al.. Clinical Guide „, 
Amierohic Infections, pp. 88-89. Star Publishing Co.. Belmont. CA (I992).| 

I reatmeni of C. difficile disease is problematic, given the high resistance of the 
organism. Oral metronidazole, bacitracin and vancomycin have been reported to be effective. 
(I'inegold et ai. p. 89.) However there are problems associated with treatment utilizing these 
compounds. Vancomycin is very expensive, some patients are unable to take oral medication, 
and the relapse rate is high (20-25%). although it may not occur for several weeks. Id. 

( '. difficile disease would be prevented or treated by neutralizing the effects of these 
toxins in the gastrointestinal tract. Thus, what is needed is an effective therapy against C. 
difficile toxin that is free of dangerous side effects, is available in large supply at a reasonable 
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price, and can be safely delivered so thai prophylactic application to patients at risk of 
developing pseudomembranous enterocolitis can be effectively treated. 

DESCRIPTION OF THE DRAWINGS 

figure I shows the reactivity of anli-C hotitlimtm IgY by Western blot. 
Figure 2 shows the IgY antibody liter to ('. hotuiinum type A toxoid in eggs, measured 
by m.JSA. 

figure :> shows the results of ('. difficile toxin A neutralization assays. 

Figure 4 shows the results of ('. difficile loxin B neutralization assays. 

f igure 5 shows the results of C. difficile loxin B neutralization assays. 

figure ft is a restriction map off. difficile toxin A gene, showing sequences of 
primers 1-4 (SFQ ID NOS:l-4). 

f igure 7 is a Western blot of C. difficile toxin A reactive protein. 

figure 8 shows ( difficile loxin A expression constructs. 

figure l > shows (' difficile loxin A expression constructs. 

Figure 10 shows the purification of recombinant ( '. difficile loxin A. 

Figure 1 I shows the results of ( '. difficile loxin A neutralization assays with antibodies 
reactive lo recombinant toxin A. 

Figure 12 shows the results for a C. difficile toxin A neutralization plate. 

Figure 13 shows the results for a C\ difficile toxin A neutralization plate. 

Figure 14 shows ihe results of recombinant (' difficile toxin A neutralization assays. 

f igure 15 shows ( '. difficile loxin A expression constructs. 

Figure I ft shows a chromatograph plotting absorbancc at 280 nm against retention time 
for a pMA 1X70-6X0 IgY PF(i preparation. 

Figure 17 shows two recombinant ( ' difficile toxin B expression constructs, 
figure 18 shows ( \ difficile loxin B expression constructs. 
Figure 19 shows C. difficile toxin B expression constructs. 
Figure 20 shows (.*. difficile toxin B expression constructs. 

figure 21 is an SDS-PACiF gel showing the purification of recombinant (* difficile 
loxin B fusion protein. 
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Figure 22 is an SDS-PAGE gel showing the purification of two histidine-tagged 
recombinant C. difficile toxin B proteins. 

Figure 23 shows (\ difficile loxin B expression constructs. 

Figure 24 is a Western blot of ('. difficile toxin B reactive protein. 

Figure 25 shows hotulinum type A toxin expression constructs; constructs used to 
provide ( \ hotulinum or C. difficile sequences are also shown. 

Figure 26 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
of recombinant (' hotulinum type A toxin fusion proteins. ( 

Figure 27 shows ('. hotulinum type A toxin expression constructs: constructs used to 
provide ( hotulinum sequences arc also shown. 

Figure 28 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
of pllisBot protein using the Ni-NTA resin. 

Figure 29 is an SDS-PAGE gel stained with Coomaisse blue showing the expression of 
pHisMot protein in B1.2KDE3) and BL21(DE3)pLysS host cells. 

Figure 30 is an SDS-PAGF gei stained with Coomaisse blue showing the purification 
of pllisBot protein using a batch absorption procedure. 

Figure 31 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
of pHisBoi and pi lisBot(native) proteins using a Ni-NTA column. 

Figure 32 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
of plIisBolA protein expressed in pllisBotA(syn) kan lack] T7/pACYCGro/BI.21(DE3) cells 
using an IDA column. 

Figure 33 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
of plIisBolA. pHisBotB and pifisBotE proteins by IDA chromatography followed by 
chromatography on S-100 to remove folding chapcrones. 

Figure 34 is an SDS-PAGE gel stained with Coomaisse blue showing the extracts 
derived from pHisBotB amp T7lac/BL2HDE3) cells before and after purification on a Ni- 
NTA column. 

hgure 35 is an SDS-PAGE gel run under native conditions and stained with 
Coomaisse blue showing the removal of folding chapcrones from IDA-purified BotB protein 
using a S-KK) column. 
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Figure 36 is an SDS-PAGE gel stained with Coomaisse blue showing proteins that 
eluted during an imidazole step gradient applied to a IDA column containing a lysate of 
pllisBotB kan laclq T7/pACYCGro/BL2l(DF3) cells. 

Figure 37 is an SDS-PAGE gel run under native conditions and stained with 
Coomaisse blue showing IDA-puriiled BolB protein before and alter ultrafiltration. 

Figure 38 is an SDS-PAGE gel stained with Coomaisse blue showing ,| lc purification 
of BolE protein using a NiNTA column. 

f igure 39 is an SOS-PAGE gel stained with Coomaisse blue showing extracts derived 
from pliisBotA kan T7 lac/BL21(DE3) pLysS cells grown in fermentation culture. 

figure 40 is a chromatogram showing proteins present after IDA-purified Both protein 
was applied to a S-100 column. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

As used herein, the term "neutralizing" is used in reference to antitoxins, particularly 
antitoxins comprising antibodies, which have the ability to prevent the pathological actions of 
the toxin against which the antitoxin is directed. 

As used herein, the term "overproducing" is used in reference to the production of 
clostridial toxin polypeptides in a host cell and indicates that the host cell is producing more 
of the clostridial toxin by virtue of the introduction of nucleic acid sequences encoding said 
clostridial toxin polypeptide than would be expressed by said host cell absent the introduction 
of said nucleic acid sequences. To allow ease of purification of toxin polypeptides produced 
in a host cell it is preferred (hat the host cell express or overproduce said toxin polypeptide at 
a level greater than I mg/liter of host cell culture. 

"A host cell capable of expressing a recombinant protein at a level greater than or 
equal to 5% of the total cellular protein" is a host cell in which the recombinant protein 
represents at least 5% of the total cellular protein. To determine what percentage of total 
cellular protein the recombinant protein represents, the following steps are taken. A total of 
10 Ol>„„, units of recombinant host cells (e. K .. 200 M l of cells at OI),,,,,, 50/mh are removed 
(at a limepoint known to represent the peak of expression of the desired recombinant protein) 
to a 1.5 ml microlugc tube and pelleted for 2 min at maximum rpm in a microluge. The 
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pellets are resuspended in 1 ml of 50 mM NaIIP0 4 . 0.5 M NaCI. 40 mM imidazole buffer 
(pH 6.8) containing I mg/ml lysozyme. The samples are incubated for 20 min at room 
temperature and stored ON at -70°C. Samples are thawed completely at room temperature 
and sonicated 2X10 seconds with a Branson Sonifier 450 microtip probe at # 3 power 
setting. The samples are centrifuged for 5 min. at maximum rpm in a microluge. An aliquot 
(20 mD of the protein sample is removed to 20 ul 2X sample buffer (this represents the total 
protein extract). The samples are heated to 95°C for 5 min. then cooled and 5 or 10 ul are 
loaded onto 12.5% SDS-PAG0 gels. High molecular weight protein markers arc also loaded 
to allow for estimation of the MW of identified recombinant proteins. After electrophoresis, 
protein is detected generally by staining with Coomassie blue and the stained gel is scanned 
using a densitometer to determine the percentage of protein present in each band. In this 
manner, the percentage of protein present in the band corresponding to the recombinant 
protein of interest may be determined. It is not necessary that C oomassie blue be employed 
for the detection of protein, a number of fluorescent dyes |t<,ir. Sypro orange S-6651 
(Molecular I'robes. Eugene. OR| may be employed and the stained gel scanned using a 
II uoro imager \c.n.. Nuor Imager SI (Molecular Dynamics. Sunnyvale. CA)|. 

"A host cell capable of expressing a recombinant protein as a soluble protein at a level 
greater than or equal to 0.25% of the total soluble cellular protein" is a host cell in which the 
amount of soluble recombinant protein present represents at least 0.25% of the total cellular 
protein. As used herein "total soluble cellular protein" refers to a clarified PHI lysate 
prepared as described in Example 3l(c)(iv). Briefly, cells are harvested following induction 
of expression of recombinant protein (at a point of maximal expression). The cells are 
resuspended in cell ^suspension buffer (CRB: 50 mM NaPO,. 0.5 M NaCI. 40 mM 
imidazole, pi I 6.8) to create a 20% cell suspension (wet weight of cells/volume of CRB) and 
cell lysa.es are prepared as described in Example 3l(c)(iv, (/.,.. sonication or homogenization 
followed by ccntrifugaiion). The cell lysate is then flocculated utilizing polyethyleneimine 
(PHI) prior to ccntrifugaiion. PHI (a 2% solution in dH ; 0. pH 7.5 with IICI) is added to the 
cell lysate to a final concentration of 0.2%. and stirred for 20 min at room temperature prior 
to ccniri (ligation |8.5()0 rpm in JAM) rotor (Beekman) for 30 minutes at 4°C|. This treatment 
removes RNA. DNA and cell wall components, resulting in a clarified, low viscosity lysate 
(TI-.I clarified lysate"). The recombinant protein present in the PKI clarified lysate is then 



WO 98/08540 



PCT/US97/15394 



purified (e.jf.. by chromatography on an IDA column lor his-tagged proteins). The amount of 
purified recombinant protein [i.e.. the eluted protein) is divided by the concentration of 
protein present in the Pl/.I clarified lysatc (typically 8 mg/ml when using a 20% cell 
suspension as the starting material) and multiplied by 100 to determine what percentage of 
total soluble cellular protein is comprised of the soluble recombinant protein (see Example 



3 b). 



As used herein, the term "fusion protein" refers to a chimeric protein containing the 
protein of interest (i.e.. C. haiulinum toxin A. B. C. D. II. I-. or Ci and fragments thereof) 
joined to an exogenous protein fragment (the fusion partner which consists of a non-toxin 
protein). The fusion partner may enhance solubility of the C. htuulinum protein as expressed 
in a host cell, may provide an affinity tag to allow purification of the recombinant fusion 
protein from the host cell or culture supernatant, or both. If desired, the fusion protein may 
be remov ed from the proiein of interest [i.e.. toxin protein or fragments thereof) prior lo 
immunization by a variety of enzymatic or chemical means known to the ai t. 

As used herein the term "non-toxin protein" or "non-toxin protein sequence" refers lo 
thai portion of a fusion protein which comprises a protein or protein sequence which is not 
derived from a bacterial toxin protein. 

The term "protein of interest" as used herein refers lo the protein whose expression is 
desired within the fusion protein. In a fusion proiein the protein of interest will be joined or 
lused with another protein or protein domain, the fusion partner, lo allow for enhanced 
stability of the protein of interest and/or ease of purification of the fusion protein. 

As used herein, the term "maltose binding protein" refers to the maliose binding 
protein of A. co/i. A portion of the maltose binding protein may be added to a protein of 
interest to generate a fusion protein: a portion of the maltose binding protein may merely 
enhance the solubility of the resulting fusion protein when expressed in a bacterial host. On 
the other hand, a portion of the maltose binding protein may allow affinity purification of the 
fusion protein on an amylose resin. 

As used herein, the term "poly-histidine tract" when used in reference to a fusion 
protein refers to the presence of two to len histidine residues al either the amino- or carboxy- 
terminus of a protein of interest. A poly-histidine tract of six to ten residues is preferred. 
The poly-histidine tract is also defined functionally as being a number of consecutive histidine 
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residues added to the protein of interest which allows the affinity purification of the resultinu 
fusion protein on a nickel-chelate or IDA column. 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample, for example, antitoxins are purified by removal of 
contaminating non-imnnmoglobulin proteins: they are also purified by the removal of 
immunoglobulin that does not bind toxin. The removal of non-immunoglobulin proteins 
and/or the removal of immunoglobulins that do not bind toxin results in an increase in the 
percent of toxin-reactive immunoglobulins in the sample. In another example, recombinant 
toxin polypeptides are expressed in bacterial host cells and the toxin polypeptides arc purified 
by the removal of host cell proteins: the percent of recombinant toxin polypeptides is thereby 
increased in the sample. Additionally, the recombinant toxin polypeptides are purified by the 
removal of host cell components such as lipopolysaccharidc (c.^.. endotoxin). 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule 
which is comprised of segments of DNA joined together by means of molecular biological 
techniques. 

I he term "recombinant protein" or "recombinant polypeptide" as used herein refers to 
a protein molecule which is expressed from a recombinant DNA molecule. 

(he term "native protein" as used herein refers to a protein which is isolated from a 
natural source a.s opposed 10 the production of a protein by recombinant means. 

As used herein the term "portion" when in reference to a proiein (as in "a portion of a 
given protein") refers to fragments of that protein. The fragments may range in size from 
lour amino acid residues to the entire amino acid sequence minus one amino acid. 

As used herein "soluble" when in reference to a protein produced by recombinant 
DNA technology in a host cell is a protein which exists in solution in the cytoplasm of the 
host cell: if the protein contains a signal sequence the soluble protein is exported to the 
periplasmie space in bacteria hosts and is secreted into the culture medium in eucaryotic cells 
capable of secretion or by bacterial host possessing the appropriate genes (/.«.•.. the kit gene). 
In contrast, an insoluble protein is one which exists in denatured form inside cytoplasmic 
granules (called inclusion bodies) in the host cell. High level expression (i.e.. greater than 10- 
20 mg recombinant protein/liter of bacterial culture) of recombinant proteins often results in 
the expressed protein being found in inclusion bodies in the bacterial host cells. A soluble 
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protein is a protein which is not found in an inclusion body inside the host cell or is found 
both in the cytoplasm and in inclusion bodies and in this case the protein may be present at 
high or low levels in the cytoplasm. 

A distinction is drawn between a soluble protein [i.e.. a protein which when expressed 
in a host cell is produced in a soluble form) and a "solubilized" protein. An insoluble 
recombinant protein found inside an inclusion body may be solubilized (i.e.. rendered into a 
soluble form) by treating purified inclusion bodies with denaturants such as guanidine 
hydrochloride, urea or sodium dodecyl sulfate (SDS). These denaturants must then be 
removed from the solubilized protein preparation to allow the recovered protein to rcnature 
(refold). Not all proteins will refold into an active conformation after solubilization in a 
denaturant and removal of the denaturant. Many proteins precipitate upon removal of the 
denaturant. SDS may be used to solubilizc inclusion bodies and will maintain the proteins in 
solution at low concentration. However, dialysis will not always remove all of the SDS (SDS 
can form micelles which do not dialyze out): therefore. SDS-solubilizcd inclusion body 
protein is soluble but not refolded. 

A distinction is drawn between proteins which are soluble ( i.e.. dissolved) in a 
solution dev oid of significant amounts of ionic detergents [ex.. SDS) or denaturants te. K .. 
urea, guanidine hydrochloride) and proteins which exist as a suspension of insoluble protein 
molecules dispersed within the solution. A soluble protein will not be removed from a 
solution containing the protein by centrifugation using conditions sufficient to remove bacteria 
present in a liquid medium [i.e.. centrifugation al 12.000 x g for 4-5 minutes), for example, 
to lest whether two proteins, protein A and protein 13. are soluble in solution, the two proteins 
are placed into a solution selected from the group consisting of PBS-NaCI (PBS containing 
0.5 M NaCI). PBS-NaCI containing 0.2% Tween 20. PBS. I'BS containing 0.2% Tween 20. 
PBS-C (PBS containing 2 mM CaCI,). PBS-C containing either 0.1 or 0.5 % Tween 20. PBS- 
C containing either 0.1 or 0.5% NP-40. PBS-C containing either 0.1 or 0.5% Triton X-100. 
PBS-C containing 0.1% sodium deoxycholate. The mixture containing proteins A and B is 
then centrifuged at 5000 x g for 5 minutes. The supernatant and pellet formed by 
centrifugation are then assayed for the presence of protein A and U. If protein A is found in 
the supernatant and not in the pellet |exccpt for minor amounts [i.e.. less than 10%) as u 
result of trapping), protein is said to be soluble in the solution tested. If the majority of 
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protein B is found in the pellet (/.t\. greater than 90%). then protein B is said to exist as a 
suspension in the solution tested. 

As used herein, the term "therapeutic amount" refers to that amount of antitoxin 
required to neutralize the pathologic effects of one or more clostridial toxins in a subject. 

The term "pyrogen" as used herein refers to a fever-producing substance. Pyrogens 
may be endogenous to the host {e.g.. prostaglandins) or may be exogenous compounds 
bacterial endo- and exotoxins, nonbacterial compounds such as antigens and certain steroid 
compounds, etc.). The presence of pyrogen in a pharmaceutical solution may be detected 
using the U.S. Pharmacopeia (USP) rabbit fever tesi (United States Pharmacopeia. Vol. XXII 
(1900) United States Pharmacopeial Convention. Rockville. MD. p. 151). 

the term "endotoxin" as used herein refers to the high molecular weight complexes 
associated with the outer membrane of gram-negative bacteria. Unpurificd endotoxin contains 
lipids, proteins and carbohydrates. Highly purified endotoxin does not contain protein and is 
referred u> as lipopolysaccharide (LPS). Because unpuriiled endotoxin is of concern in the 
production of pharmaceutical compounds (e.g.. proteins produced in E. coli using recombinant 
DNA technology), the term endotoxin as used herein refers to unpurificd endotoxin. Bacterial 
endotoxin is a well known pyrogen. 

As used herein, the term "endotoxin-free" when used in reference to a composition to 
be administered parenteral!) (with the exception of intrathecal administration) to a host means 
that the dose to be delivered contains less than 5 F-U/kg body weight [FDA Guidelines for 
Parenteral Drugs (December l ( ^87)|. Assuming a weight of 70 kg for an adult human, the 
close must contain less than 350 HU to meet FDA Guidelines for parenteral administration. 
Endotoxin levels are measured herein using the Limulus Amebocyte Lysate {LAL) test 
(Limulus Amebocyte Lysate Pyrochrome ,N1 . Associates of Cape C od. inc. Woods Hole. MA), 
lo measure endotoxin levels in preparations of recombinant proteins. 0.5 ml of a solution 
comprising 0.5 mg of purified recombinant protein in 50 mM NaP() 4 . pH 7.0. 0.3M NaCI and 
10% glycerol is used in the LAL assay according to the manufacturer's instructions for the 
endpoini chromogenic without diazo-coupling method [the specific components of the buffer 
containing recombinant protein to be analyzed in the LAL test are not important: any buffer 
having a neutral pi I may be employed (see for example, alternative buffers employed in 
Lxamples 34. 40 and 45) |. Compositions containing less than or equal to than 250 endotoxin 
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units (EU)/mg of purified recombinant protein are herein defined as "substantially endotoxin- 
frec." Preferably the composition contains less than or equal to 100. and most preferably less 
than or equal to 60. (KUVmg of purified recombinant protein. Typically, administration of 
bacterial toxins or toxoids to adult humans for the purpose of vaccination involves doses of 
about 10-500 ug protein/dose. Therefore, administration of 10-500 ug of a purified 
recombinant protein to a 70 kg human, wherein said purified recombinant protein preparation 
contains 60 FU/mg protein, results in the introduction of only 0.6 to 30 VAJ {i.e.. o.2 to 8.6% 
of the maximum allowable endotoxin burden per parenteral dose). Administration of 10-500 
MS of a purified recombinant protein to a 70 kg human, wherein said purified recombinant 
protein preparation contains 250 HU/mg protein, results in the introduction of only 2.5 to 125 
i:U (i.e.. 0.7 to 36% of the maximum allowable endotoxin burden per parenteral dose). 

The I.AL test is accepted by the U.S. FDA as a means of detecting bacterial 
endotoxins (21 C.I .R. §{S 660.10(1 -105). Studies have shown tha, «he LAI. ,es, is equivalent 
or superior lo the USI» rabbit pyrogen lest for the detection of endotoxin and thus the 1.A1. 
test can be used as a surrogate for pyrogenicity studies in animals |F.C. I'erason. /'yw^m: 
ouhioxms. Lll. wstin K ami dvpyro K ci union. Marcel Dekker. New York (I0H5). pp. 150- 155 1. 
The FDA Bureau of Biologies accepts .he LAL assay in place of the USP rabbit pyrogen test 
so long as Hie I.AL assay utilized is shown to be as sensitive as. or more sensitive as the 
rabbit test |Fed. Reg.. 38. 26130 (1980)1. 

The term "monovalent" when used in reference to a clostridial vaccine refers to a 
vaccine which is capable of provoking an immune response in a host animal directed against a 
single type of clostridial toxin. For example, if immunization of a host with ( hntulLm 
type A toxin vaccine induces antibodies in the immunized host which protect against a 
challenge with type A toxin but not against challenge with type H. (.. D. [.. F or (i toxins, 
then the type A vaccine is said to be monovalent. In contrast, a "multivalent" vaccine 
provokes an immune response in a host animal directed against several (/.,.. more than one) 
clostridial toxins. For example, if immunization of a host with a vaccine comprising C. 
homlimim type A and B toxins induces the production of antibodies which protect the host 
against a challenge with both type A and B toxin, the vaccine is said to be multivalent (in 
particular, this hypothetical vaccine is bivalent). 
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As used herein the term "immunogenics! ly-effective amount" refers to that amount of 
an immimogen required to invoke the production of protective levels of antibodies in a host 
upon vaccination. 

The term "protective lever', when used in reference to the level of antibodies induced 
upon immunization of the host with an immunogen which comprises a bacterial toxin, means 
a level of circulating antibodies sufficient to protect the host from challenge with a lethal dose 
of the toxin. 

As used herein the terms "protein" and "polypeptide" refer to compounds comprising 
amino acids joined via peptide bonds and are used interchangeably. 

The terms "toxin" and "neurotoxin" when used in reference to toxins produced by 
memhers {i.e.. species and strains) of the genus Clostridium are used interchangeably and 
refer to the proteins which are poisonous to nerve tissue. 

I he term "receptor-binding domain" when used in reference to a ('. botulinum toxin 
refers to the carboxy-tcrminal portion of the heavy chain (H ( . or the C fragment) of the toxin 
which is presumed to be responsible for the binding of the active toxin (i.e.. the derivative 
toxin comprising the H and I. chains joined via disulfide bonds) to receptors on the surface of 
synaptosomes. The receptor-binding domain for ('. hotulinum type A toxin is dcimc^ herein 
as comprising amino acid residues 861 through 12% of SEQ II) NO:2X. The receptor- 
binding domain for (*. hotulinum type 13 toxin is defined herein as comprising amino acid 
residues 84X through 1291 of SEQ ID NO:40 (strain Eklund I7B). The receptor-binding 
domain of ( * hotulinum type CI toxin is defined herein as comprising amino acid residues 
856 through 1291 of SEQ ID N():60. The receptor-binding domain of C. hotulinum type D 
toxin is ik\}nL\\ herein as comprising amino acid residues 852 through 1276 of SEQ ID 
NO:66. The receptor-binding domain of C hotulinum type E toxin is defined herein as 
comprising amino acid residues 835 through 1250 of SEQ ID NO:50 (Beluga strain). The 
receptor-binding domain of C\ hotulinum type F toxin is defined herein as comprising amino 
acid residues 853 through 1274 of SEQ ID NO:7I. The receptor-binding domain of C. 
hotulinum type (i toxin is defined herein as comprising amino acid residues 853 through I2 ( )7 
oJ SEQ ID NO:77. Within a given serotype, small variations in the primary amino acid 
sequence of the botulinal toxins isolated from different strains has been reported [Whelan et 
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al. (1992). supra and Minton (1995) Curr. Top. Microbiol. Immunol. 195:161-194]. The 
present invention contemplates fusion proteins comprising the receptor-binding domain of C. 
hotulhmm toxins from serotypes A-G including the variants found among different strains 
within a given serotype. The receptor-binding domains listed above are used as the prototype 
tor each strain within a serotype. Fusion proteins containing an analogous region from a 
strain other than the prototype strain are encompassed by the present invention. 

fusion proteins comprising the receptor binding domain Uc. t fragment) of botulinal 
toxins may include amino acid residues located beyond the termini of the domains defined 
above, f or example, the pHisBolB protein contains amino acid residues 846-1291 ofSI-Q ID 
NO.-40: this fusion protein thus comprises the receptor-binding domain lor ( hoiulinum type 
B toxin as defined above [i.e.. lle-848 through Glu-1291 ). Similarly. pHisBotll contains 
amino acid residues 827-1252 of SI-Q ID NO:50 and pilisBoKi contains amino acid residues 
851-1207 ofSr-0 ID N():77. Thus, both pHisHotl- and pHisBotG fusion proteins contain a 
feu amino acids located beyond the N-terminus of the defined receptor-binding domain. 

flic terms "native gene" or "native gene sequences" are used to indicate DNA 
sequences encoding a particular gene which contain the same DNA sequences as found in the 
gene as isolated from nature. In contrast, "synthetic gene sequences" are [)NA sequences 
which are used to replace the naturally occurring DNA sequences when the naturally 
occurring sequences cause expression problems in a given host cell, for example, naturally- 
occurring DNA sequences encoding codons which are rarely used in a host cell may be 
replaced ((:». by site-directed mutagenesis) such that the synthetic DNA sequence represents 
a more frequently used codon. The native DNA sequence and the synthetic DNA sequence 
will preferably encode the same amino acid sequence. 

SUMMARY OF THE INVENTION 

The present invention relates to the production of polypeptides derived from toxins 
particularly in recombinant host cells. In one embodiment, the present invention provides a 
host cell containing a recombinant expression vector, said vector encoding a protein 
comprising at least a portion of a Clostridium hoiulinum toxin, said to.xin selected from the 
group consisting of type B toxin and type E toxin. The present invention is not limited by 
the nature of sequences encoding portions of the C. hoiulinum toxin. These sequences may be 
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derived from the native gene sequences or alternatively they may comprise synthetic gene 
sequences. Synthetic gene sequences are employed when expression of the native gene 
sequences is problematic in u given host cell (e.g.. when the native gene sequences contain 
sequences resembling yeast transcription termination signals and the desired host cell is a 
yeast cell). 

In one embodiment, the host cell is capable of expressing the recombinant C. 
homlimmi toxin protein at a level greater than or equal to 2% to 40% of the total cellular 
protein and preferably at a level greater than or equal to 5% of the total cellular protein. In 
another embodiment, the host cell is capable of expressing the recombinant ( homlimmi 
toxin protein as a soluble protein at a level greater than or equal to 0.25% of the total cellular 
protein and preferably at a level greater than or equal to 0.25% to 10% of the total cellular 
protein. 

The present invention is not limited by the nature of the host cell employed for the 
production of recombinant C hoiulinum toxin proteins. In a preferred embodiment, the host 
cell is an /•;. cali cell. In another preferred embodiment, the host cell is an insect cell: 
particularly preferred insect host cells are Spodoptcrc, fru^ipurdci (SfV) cells. In another 
preferred embodiment, the host cell is a yeast cell: particularly preferred yeast cells are Pichiu 
pustoris cells. 

In another embodiment, the invention provides a host cell containing a recombinant 
expression vector, said vector encoding a fusion protein comprising a non-toxin protein 
sequence and at least a portion of a Clostridium homlimmi toxin, said toxin selected from the 
group consisting of type 13 toxin and type f£ toxin. The invention is not limited by the nature 
ol ihe portion of the Clostridium bom/imm, toxin selected. In a preferred embodiment, the 
portion of the toxin comprises the receptor binding domain (/>.. the C fragment of the toxin,. 
The present invention is not limited by the nature of the non-toxin protein sequence 
employed. In a preferred embodiment, the non-toxin protein sequence comprises a poly- 
histidine tract. A number of alternative fusion tags or fusion partners are known to the art 
{ex.. MUI\ (1ST. protein A. etc.) and may be employed lor the production of fusion proteins 
comprising a portion of a boiulinal toxin. 
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The present invention further provides a vaccine comprising a fusion protein, said 
I usion protein comprising a non-toxin protein sequence and at least a portion of a Clostridium 
hofulinum toxin, said toxin selected from the group consisting of type B toxin and type H 
toxin. The vaccine may be a monovalent vaccine (i.e.. containing only a toxin B fusion 
protein or a toxin E fusion protein), a bivalent vaccine [i.e.. containing both a toxin B fusion 
protein and a toxin E fusion protein) or a trivalent or higher valency vaccine. In a preferred 
embodiment, the toxin B fusion protein and/or toxin li fusion protein is combined with a 
fusion protein comprising a non-toxin protein sequence and al least a portion of Clostridium 
hofulinum type A toxin. The present invention is not limited by the nature of the portion of 
the Clostridium hoiulinum toxin selected. In a preferred embodiment, the portion of the toxin 
comprises the receptor binding domain (i.e.. the C fragment of the toxin). The present 
invention is not limited by the nature of the non-toxin protein sequence employed. In a 
preferred embodiment, the non-toxin protein sequence comprises a poly-histidine tract. A 
number ol alternative fusion tags or fusion partners are known to the art {e.ii.. MBI\ GST. 
\* protein A. etc.) and may be employed for the generation of fusion proteins comprising 

vaccines. When a fusion partner </>.. the non-toxin protein sequence) is employed for the 
production of a recombinant ( hotulinul toxin protein, the fusion partner may be removed 
I mm the recombinant hotulinul toxin protein if desired Ha'., prior to administration of the 
protein to a subject) using a variety of methods known to the art (c.^.. digestion of fusion 
20 proteins containing FactorXa or thrombin recognition sites with the appropriate enzyme). A 
number of the pLTHis vectors employed herein provide an N-terminal his-iag followed by a 
FactorXa cleavage site (see Example 2Ka): the hotulinul C fragment sequences follow the 
FactorXa site and thus. FactorXa can be used to remove the his-lag from the botulinal fusion 
protein. In a preferred embodiment, the vaccine is substantially endotoxin-free. 
- D 1 he present invention is not limited by the method employed for the generation of 

vaccine comprising fusion proteins comprising a non-toxin protein sequence and at least a 
portion of a Clostridium hofulinum toxin. The fusion proteins may be produced by 
recombinant DNA means using either native or synthetic gene sequences expressed in a host 
cell fhe present invention is not limited to the production of vaccines using recombinant 
host cells: cell free in vitro transcription/translation systems may be employed for the 
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expression of the nucleic acid constructs encoding the fusion proteins of the present invention. 
An example of such a cell-free system is the commercially available TnT™ Coupled 
Reticulocyte Lysate System (Promega Corporation. Madison. Wl). Alternatively, the fusion 
proteins of the present invention may be generated by synthetic means (i.e.. peptide 
synthesis). 

The present invention further provides a method of generating antibody directed 
against a Clostridium hoiulhum, toxin comprising: a) providing in any order: i) an antigen 
comprising a fusion protein comprising a non-toxin protein sequence and at least a portion of 
a Closiridium hoiulinum toxin, said toxin selected from the group consisting of type 13 toxin 
and type L toxin, and ii) a host: and b) immunizing the host with the antigen so as to 
generate an antibody. In a preferred embodiment, the antigen used to immunize the host also 
contains a fusion protein comprising a non-toxin protein sequence and at least a portion of 
( -losnklium hmulinum type A toxin. The present invention is no. limited by the nature of the 
portion of the Clostridium hoiulinum toxin selected. In a preferred embodiment, the portion 
of the toxin comprises the receptor binding domain (i.e.. the C fragment of the toxin). The 
present invention is not limited by the nature of the non-toxin protein sequence employed. In 
a preferred embodiment, the non-toxin protein sequence comprises a poly-histidine tract. A 
number of alternative fusion tags or fusion partners arc known to the art (e.^.. MB P. GST. 
protein A. etc.) and may be employed for the generation of fusion proteins comprising 
vaccines. When a fusion partner (i.e.. the non-toxin protein sequence) is employed for the 
production of a recombinant C hoiulinal toxin protein, the fusion partner may be removed 
Iron, the recombinant C hoiulinal toxin protein if desired (, prior to administration of the 
protein to a subject) using a variety of methods known to the art (e. K .. digestion of fusion 
proteins containing FuctorXa or thrombin recognition .sites with the appropriate enzyme). 

The present invention is not limited by the nature of the host employed for the 
production of the antibodies of the invention. In a preferred embodiment, the host is a 
mammal, preferably a human. The antibodies of the present invention may be generated 
using non-mammalian hosts such as birds, preferably chickens. In a preferred embodiment 
the method of the present invention further comprised the step c. of collecting the antibodies 
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from the host. In. yet another embodiment, the method of the present invention further 
comprises the step d) of purifying the antibodies. 

The present invention further provides antibodies raised according to the above 
methods. 

The present invention further contemplates multivalent vaccines comprising at least 
two recombinant ('. hoiulinum toxin proteins derived from the group consisting of C. 
hotulinum serotypes A. B. C D. E. F. and G. The invention contemplates bivalent, trivalent. 
quadravalem. pentavalent. heptavalent and septivalent vaccines comprising recombinant ( 
hoiulinum toxin proteins. Preferably the recombinant C hoiulinum toxin protein comprises 
the receptor binding domain (i.e.. C fragment) of the toxin. 

DESCRIPTION OF THE INVENTION 

I he present invention contemplates vaccinating humans and other animals with 
polypeptides derived from C hoiulinum neurotoxins which are substantially endotoxin-free. 
Ihese hotulinal peptides are also useful for the production of antitoxin. Anti-hotulinal toxin 
antitoxin is useful for the treatment of patients effected by or at risk of symptoms due to the 
action of C. hoiulimtm toxins. The organisms, toxins and individual steps of the present 
invention are described separately below. 

I. Clostridium Species, Clostridial Diseases And Associated Toxins 

A preferred embodiment of the method of the present invention is directed toward 
obtaining anlihodies against Clostridium species, their toxins, enzymes or other metabolic by- 
products, cell wall components, or synthetic or recombinant versions of any of these 
compounds. It is contemplated that these antibodies will be produced by immunization of" 
humans or other animals. It is not intended that the present invention be limited to any 
particular toxin or any species of organism. In one embodiment, toxins from all Clostridium 
species arc contemplated as immunogens. Examples of these toxins include the neuraminidase 
toxin of C hu/yricum. C sordcltii toxins I I I* and l/f. toxins A. B. (/. I). I.-. I", and Ci of C 
hotulimim and the numerous ( * perfrin^ens toxins. In one preferred embodiment, toxins A. 
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B. and R of C bondmum are contemplated as immunogens. Table 2 above lists various 
Clostridium species, their toxins and some antigens associated with disease. 



TABLE 2 

Clostridial Toxins 


Organism 


Toxins and Disease- Associated Ant it: ens 




A. B. C, C\, D. L F. G 


( '. httfvricttut 


Neuraminidase 


('. tliflkilc 


A. B. F.nterotoxin (not A nor B). Motility Altering Factor. Lou 
Molecular Weight Toxin. Others 


t '. /v/ frtn^cns 


«. p. c. i. y. o. v. G. k\ I, u. u 


( htfcrmcnians 


HT. IT. a. (J. y 


( '. IHlYVI 


«. |i. y. 6. c. C. v. 0 


( ' svptwum 


a. p. y. o 


( '. hisiolytu imi 


u. {y y. i\ t plus additional ciuvmes 




a. |L y. tS 



It is not intended that antibodies produced against one toxin will only be used against 
that toxin, h is contemplated that antibodies directed against one toxin u-. K .. ( \ perjrin^ns 
type A entcrotoxin) may he used as an effective therapeutic against one or more toxin(s) 
produced by other members of the genus Clostridium or other toxin producing organisms 

Bacillus ccrcus. Sicphylncccus aureus. Streptococcus mulans. Acinelolmacr 
culcoacciicus. I'sciuhnmnas acn^innsu. other Pseudomnnus species, etc.). It is further 
contemplated that antibodies directed against the portion of the toxin which binds to 
mammalian membranes w.f!.. ( ; pcrfrinacm entero.oxin A) can also be used against other 
organisms. It is contemplated that these membrane binding domains are produced 
synthetically and used as immunouens. 



H. Obtaining Antibodies In Non-Mammals 

A preferred embodiment of the method of the present invention for obtaining 
antibodies involves immunization. However, it is also contemplated that antibodies could be 
obtained from non-mammals without immunization. In the case where no immunization is 
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contemplated, the present invention may use non-mammals with preexisting antibodies to 
toxins as well as non-mammals that have antibodies to whole organisms by virtue ol* reactions 
with the administered antigen. An example of the latter involves immunization with synthetic 
peptides or recombinant proteins sharing epitopes with whole organism components. 

In a preferred embodiment, the method of the present invention contemplates 
immunizing non-mammals with bacterial toxin(s). It is not intended that the present invention 
be limited to any particular toxin. In one embodiment, toxin from ail clostridial bacteria 
sources i.wc Table 2) are contemplated as immunogens. Examples of these toxins are C. 
hutyrintm neuraminidase toxin, toxins A. B. C. D. F.. F. and Ci from ( hotulinum . 
(\ perfrinxens toxins u. (*. i:, and t. and C. sordellii toxins I IT and IT. In a preferred 
embodiment. C. hotulinum toxins A. B. C. 1). E. and F (or fragments thereof) are 
contemplated as immunogens. 

A particularly preferred embodiment involves the use of bacterial toxin protein or 
Iragments of toxin proteins produced by molecular biological means [i.e.. recombinant toxin 
L> proteins). In a preferred embodiment, the immunogen comprises the receptor-hindinu domain 
{i.e.. the - so kl) carbox> -terminal portion of the heavy chain: also referred to us the C 
fragment) of C. hotulinum serotype A neurotoxin produced by recombinant DNA technology. 
In another preferred embodiment, the immunogen comprises the receptor-binding domain of 
C. hotulinum serotype B neurotoxin produced by recombinant DNA technology. In yet 
20 another preferred embodiment, the immunogen comprises the receptor-binding domain region 
ol r. hotulinum serotype E neurotoxin produced by recombinant DNA technology. In yet 
another prelerred embodiment, the immunogen comprises the receptor-binding domain reuion 
of hotulinum serotype CI neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain reuion 
2? of ('. hotulinum serotype C2 neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
off. hotulinum serotype D neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
ol ('. hotulinum serotype F neurotoxin produced by recombinant DNA technology. In yet 
30 another preferred embodiment, the immunogen comprises the receptor-binding domain reuion 
of C. hotulinum serotype (i neurotoxin produced by recombinant DNA technology. In a 
preferred embodiment, the recombinant botulinal toxin proteins are expressed as fusion 
proteins U'.tf.. as histidine-lagged proteins). In a still further preferred embodiment, the 
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immunogen is a multivalent vaccine comprising the receptor-binding domain region of C. 
hoiulhmm toxin from two or more toxins selected from the group consisting of type A. type 
B. type C (including CI and C2). type D. type F. and type F toxin. 

When immunization is used, the preferred non-mammal is from the class Aves. All 
birds are contemplated {e. K .. duck, ostrich, emu. turkey, etc.). A preferred bird is a chicken. 
Imponanily. chicken antibody does not fix mammalian complement. \See H.N. Benson el a/., 
J. Immunol. 87:616 ( 1% I ).| Thus, chicken antibody will normally not cause a complement- 
dependent reaction. (A. A. Benedict and K. Yamaga. "ImmunoKhhulins ami Antibody 
I'nuluciion in Avian Species. " in ( umparaliw Immunology (J.J. Marchaloni. cd.). pp. 335- 
375. Bluckwell. Oxford (1%6).J Thus, the preferred antitoxins of the present invention will 
not exhibit complement-related side effects observed with antitoxins known presently. 

When birds are used, it is contemplated that the antibody will be obtained from either 
the bird serum or the egg. A preferred embodiment involves collection of the antibody from 
the egg. Laying hens transport immunoglobulin to the egg yolk ("IgY") in concentrations 
equal to or exceeding that found in serum. \See R. Patterson el ai. J. Immunol. 89:272 
(1962); and S.I3. Carroll and B.D. Stollar. J. Biol. Chem. 258:24 (1983).] In addition, the 
large volume of egg yolk produced vastly exceeds the volume of scrum that can be safely 
obtained from the bird over any given time period. Finally, the antibody from eggs is purer 
and more homogeneous: there is far less non-immunoglobulin protein (as compared to scrum) 
and only one class of immunoglobulin is transported to the yolk. 

When considering immunization with toxins, one may consider modification of the 
toxins to reduce the toxicity. In this regard, it is not intended that the present invention be 
limited by immunization with modified toxin. Unmodified ("native") toxin is also 
contemplated as an immunogen. 

It is also not intended that the present invention be limited by the type of modification 
- if modification is used. The present invention contemplates all types of toxin modification, 
including chemical and heal treatment of the toxin. The preferred modification, however, is 
formaldehyde treatment. 

It is not intended that the present invention be limited to a particular mode of 
immunization: the present invention contemplates all modes of immunization, including 
subcutaneous, intramuscular, intraperitoneal, and intravenous or intravascular injection, as well 
as per as administration of immunogen. 
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The present invention further contemplates immunization with or without adjuvant. 
(Adjuvant is defined as a substance known to increase the immune response to other antigens 
when administered with other antigens.) If adjuvant is used, it is not intended that the present 
invention be limited to any particular type of adjuvant - or that the same adjuvant, once used, 
be used all the time. While the present invention contemplates all types of adjuvant, whether 
used separately or in combinations, the preferred use of adjuvant is the use of Complete 
Freund's Adjuvant followed sometime later with Incomplete Freund's Adjuvant. Another 
preferred use of adjuvant is the use of Gerbu Adjuvant. The invention also contemplates the 
use of RIBI fowl adjuvant and Qui I A adjuvant. 

When immunization is used, the present invention contemplates a wide variety of 
immunization schedules. In one embodiment, a chicken is administered toxin(s) on day zero 
and subsequently receives toxin(s) in intervals thereafter. It is not intended that the present 
invention be limited by the particular intervals or doses. Similarly, it is not intended that the 
present invention be limited to any particular schedule for collecting antibodv. The preferred 
collection time is sometime after day 100. 

Where birds are used and collection of antibody is performed by collecting egiis. the 
eggs may be stored prior to processing for antibody. It is preferred that eggs be stored at 4°C 
for less than one year. 

It is contemplated that chicken antibody produced in this manner can be buffer- 
extracted and used analytically. While unpurificd. this preparation can serve as a reference 
lor activity of the antibody prior to further manipulations U\u.. immunoaffmity purification). 

III. Increasing The Effectiveness Of Antibodies 

When purification is used, the present invention contemplates purifying to increase the 
elfectiveness of both non-mammalian antitoxins and mammalian antitoxins. Specifically, the 
present invention contemplates increasing the percent of toxin-reactive immunoglobulin. The 
preferred purification approach for avian antibody is polyethylene glycol (Pl'-Ci) separation. 

I he present invention contemplates that avian antibodv be initially purified using 
simple, inexpensive procedures. In one embodiment, chicken antibody from eggs is purified 
by extraction and precipitation with PEG. PUG purification exploits the differential solubility 
of lipids (which are abundant in egg yolks) and yolk proteins in high concentrations of PUG 
8000. [Poison et <//.. Immunol. Comm. 9:495 (19K0).] The technique is rapid, simple, and 
relatively inexpensive and yields an immunoglobulin fraction that is significantly purer in 
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terms of contaminating non-immunoglobulin proteins than the comparable ammonium sulfate 
fractions of mammalian sera and horse antibodies. The majority of the PEG is removed from 
the precipitated chicken immunoglobulin by treatment with ethanol. Indeed. PEG-purified 
antibody is sufficiently pure that the present invention contemplates the use of PEG-purified 
antitoxins in the passive immunization of intoxicated humans and animals. 

IV. Treatment 

The present invention contemplates antitoxin therapy for humans and other animals 
intoxicated by bacterial toxins. A preferred method of treatment is by intravenous 
administration of anti-boutlinal antitoxin: oral administration is also contemplated for other 
clostridial antitoxins. 



A. Dosage Of Antitoxin 

h was noted by way of background that a balance must be struck when administering 
currently available antitoxin which is usually produced in large animals such as horses: 
sufficient antitoxin must be administered to neutralize the toxin, but not so much antitoxin as 
to increase the risk of untoward side effects. These side effects are caused by: i) patient 
sensitivity to foreign {e. K . horse) proteins: ii) anaphylactic or immunogenic properties of non- 
immunoglobulin proteins: iii) the complement fixing properties of mammalian antibodies: 
and/or iv) the overall burden of foreign protein administered. It is extremely difficult to 
strike this balance when, as noted above, the degree of intoxication (and hence the level of 
antitoxin therapy needed) can only be approximated. 

I he present invention contemplates significantly reducing side effects so that this 
balance is more easily achieved. Treatment according to the present invention contemplates 
reducing side effects by using PEG-purilied antitoxin from birds. 

In one embodiment, the treatment of the present invention contemplates the use of 
PIXi-purilled antitoxin from birds. The use of yolk-derivcd. PEG-purified antibody as 
antitoxin allows for the administration of: I ) non( mammalian )-complemcni-fi.\ing. avian 
antibody: 2) a less heterogeneous mixture of non-immunoglobulin proteins: and 3) less total 
protein to deliver the equivalent weight of active antibody present in currently available 
antitoxins. The non-mammalian source of the antitoxin makes it useful for treating patients 
who are sensitive to horse or other mammalian sera. 
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B. Delivery Of Antitoxin 

Although it is not intended to limit the route of delivery, the present invention 
contemplates a method for antitoxin treatment of bacterial intoxication in which delivery of 
antitoxin is oral. In one embodiment, antitoxin is delivered in a solid form (t'.tf.. tablets). In 
an alternative embodiment antitoxin is delivered in an aqueous solution. When an aqueous 
solution is used, the solution has sufficient ionic strength to soiubilize antibody protein, yet is 
made palatable for oral administration. The delivery solution may also be buffered (e.g.. 
carbonate buffer pH 9.5) which can neutralize stomach acids and stabilize the antibodies when 
the antibodies are administered orally. In one embodiment the delivery solution is an aqueous 
solution. In another embodiment the delivery solution is a nutritional formula. Preferably, 
the delivery solution is infant formula. Yet another embodiment contemplates the delivery of 
lyophilized antibody encapsulated or microencapsulated inside acid-resistant compounds. 

Methods of applying enteric coatings to pharmaceutical compounds are well known to 
the an [companies specializing in the coating of pharmaceutical compounds are available: for 
example. The Coating Place (Verona. WI) and AAI (Wilmington. NC)|. linterie coatings 
which are resistant to gastric fluid and whose release (/>.. dissolution of the coating u> release 
the pharmaceutical compound) is pH dependent are commercially available | lor example, the 
polymcthacryiates f- tidragitH L and KudragituO S (Rohm GmbH)f l-udragit'K; S is soluble in 
intestinal fluid from pi I 7.0: this coaling can be used to microencapsulate lyophilized antitoxin 
antibodies and the particles are suspended in a solution having a pi I above or below pi J 7.0 
lor oral administration. The microparticles will remain intact and undissolved until they 
reached the intestines where the intestinal pH would cause them to dissolve thereby releasing 
the antitoxin. 

The invention contemplates a method of treatment which can be administered for 
treatment of acute intoxication. In one embodiment, antitoxin is administered orallv in either 
a delivery solution or in tablet form, in therapeutic dosage, to a subject intoxicated by the 
bacterial toxin which served as immunogen for the antitoxin, 

The invention also contemplates a method of treatment which can be administered 
prophylactieally. In one embodiment, antitoxin is administered orally, in a delivery solution, 
in therapeutic dosage, to a subject, to prevent intoxication of the subject by the bacterial toxin 
which served as immunogen for the production of antitoxin. In another embodiment, 
antitoxin is administered orally in solid form such as tablets or as microencapsulated particles. 
Microencapsulation of lyophilized antibody using compounds such as Ludragit® (Rohm 
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GmbH) or polyethylene glycol . which dissolve al a wide range of pH units, allows the oral 
administration of solid antitoxin in a liquid form </.<?.. a suspension) to recipients unable to 
tolerate administration of tablets (e.g.. children or patients on feeding tubes). In one preferred 
embodiment the subject is a child. In another embodiment, antibody raised against whole 
bacterial organism is administered orally to a subject, in a delivery solution, in therapeutic 
dosage. 

V. Vaccines Against Clostridial Species 

The invention contemplates the generation of mono- and multivalent vaccines for the 
protection of an animal (particularly humans) against several clostridial species. Of particular 
interest are vaccines which stimulate the production of a humoral immune response to C. 
hnndimm, ( ' te iani unci C difficile in humans. The antigens comprising the vaccine 
preparation may he native or recombinant^ produced toxin proteins from the clostridial 
species listed above. When toxin proteins are used as immunogens they arc generally 
modified to reduce the toxicity. This modification may be by chemical or genetic [i.e.. 
recombinant DNA technology, means. In general genetic detoxification (i.e.. the expression 
of nontoxic fragments in a host cell) is preferred as the expression of nontoxic fragments in a 
host cell precludes the presence of intact, active toxin in the final preparation. However, 
when chemical modification is desired, the preferred toxin modification is formaldehyde 
treatment. 

The invention contemplates that recombinant C hoiulinum toxin proteins be used as 
antigens in mono- and multivalent vaccine preparations. Soluble, substantially endo toxin-free 
recombinant ('. hnndinwn toxin proteins derived from serotypes A. B and E may be used 
individually (i.e.. as mono-valent vaccines) or in combination Ut:. as a multi-valent vaccine). 
In addition, the recombinant ('. hondinum toxin proteins derived from serotpes A. 13 and F. 
may be used in conjunction with either recombinant or native toxins or toxoids from other 
serotypes of C. hondinum. C. difficile and C. teUmi as antigens for the preparation of these 
mono- and multivalent vaccines. I. is contemplated that, due to the structural similarity of C. 
luuulinum and C. ,e,ani toxin proteins, a vaccine comprising C. difficile and hondimm, toxin 
proteins (native or recombinant or a mixture thereof) be used to stimulate an immune 
response against C. botulinum, C. tviuni and C. difficile. 
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The present invention further contemplates multi-vaient vaccines comprising two or 
more botulinal toxin proteins selected from the group comprising recombinant C hotulinum 
toxin proteins derived from serotypes A. B. C (including CI and C2). D. E. F and G. 

The adverse consequences of exposure to botulinal toxin would be avoided by 
immunization of subjects at risk of exposure to the toxin with nontoxic preparations which 
confer immunity such as chemically or genetically detoxified toxin. 

Vaccines which confer immunity against one or more of the toxin types A. B. F. F 
and G would he useful as a means of protecting humans from the deleterious effects of those 
(' hotulinum toxins known to affect man. Indeed as the possibility exists that humans could 
be exposed to any of the seven serotypes of C hotulinum toxin (c.#.. during biological 
warfare or the production of toxin in a laboratory setting), multivalent vaccines capable of 
conferring immunity against toxin types A-G (including both CI and C2 toxins) would be 
useful for the protection of humans. Vaccines which confer immunity against one or more of 
the toxin types C. D and E would be useful for veterinary applications. 

The botulinal neurotoxin is synthesized as a single polypeptide chain which is 
processed into a heavy (H: -100 kD) and a light (L; -50 kD) chain by cleavage with proteolytic 
enzymes: these two chains are held together via disulfide bonds in the active toxin (referred to 
as derivative toxin) |B.R. DasGupta and II. Sugiyama. Biochcm, Biophys. Res. Commun. 
48:108 (1972): reviewed in B.R. DasGupta. J. Physiol. 84:220 (1990). H. Sugiyama. 
Microbiol. Rev. 44:419 (1980) and C.L.. Hatheway. Clin. Microbiol. Rev. 3:66 (1990)). The 
heavy chain of the active toxin is cleaved by trypsin to produce two fragments termed H ( 
(also referred to as 1-1, or C) and II N (also referred to as \l : or B). The H ( . fragment (-46 kD) 
comprises the carboxy end of the II chain. The H N . fragment (-49 kD) comprises the ammo 
end and remains attached to the L chain (H N L). Neither H c or H s .l. is toxic. \\. competes 
with whole derivative toxin for binding to synaptosomes and therefore 1I ( is said to contain 
the receptor binding site. The II ( . and H N fragments of botulinal toxin are analogous to the 
fragments C and B of tetanus toxin which are produced by papain cleavage. The C fragment 
of tetanus toxin has been shown to be responsible for the binding of tetanus toxin to purified 
gangliosides and neuronal cells |Malpern and Loftus, J. Biol. Chem. 288: M 188 (1993)]. 

Antisera raised against purified preparations of isolated botulinal H and L chains have 
been shown to protect mice against the lethal effects of the toxin: however, the effectiveness 
of the two antisera differ with the anti-H sera being more potent (II. Sugiyama. supra). 
While the different botulinal toxins show structural similarity to one another, the different 
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serotypes are reported to be immunologically distinct (i.e.* sera raised against one toxin type 
does not cross-react to a significant degree with other types). Thus, the generation of 
multivalent vaccines may require the use of more than one type of toxin. 

(' hotulinum toxin genes from all seven serotypes have been cloned and sequenced 
(Mimon (1995). supra): in addition, partial amino acid sequence is available for a number of 
C. hotulinum toxins isolated from different strains within a given serotype. The C. hotulinum 
toxins contain about 1250-1300 amino acid residues. On the DNA level, the overall degree of 
homology between C hotulinum serotypes A, B. C. D and K toxins averages between 50 and 
M)% identity with a greater degree of homology being found between H chain-encoding 
regions than between those encoding L chains [Whelan et at. (1992) Appl. Environ. 
Microbiol. 58:2345). The degree of identity between C. hotulinum toxins on the amino acid 
level reflects the level of DNA sequence homology. The most divergent area of DNA and 
amino acid sequence is found within the carboxy-terminal area of the various C\ hotulinum II 
chain genes. This portion of the toxin (i.e.. H r or the C fragment) plays a major role in cell 
binding. As toxin from different serotypes is thought to bind to distinct cell receptor 
molecules, it is not surprising that the toxins diverge significantly over this region. 

Within a given serotype, small variations in the primary amino acid sequence of the 
botulinal toxins isolated from different strains has been reported | Whelan et at. (1992). supra 
and Minion (1995), supra]. The present invention contemplates fusion proteins comprising 
portions of C. hotulinum toxins from serotypes A-G including the variants found among 
different strains within a given serotype. The present invention provides oligonucleotide 
primers which may be used to amplify the C fragment or receptor-binding region of the toxin 
gene from various strains of C. hotulinum serotype A. serotype B. serotvpe C (CI and C2). 
serotype I), serotype E. serotype I and serotype G. A large number of different strains off. 
hotulinum serotype A. serotype B. serotype C serotype D serotype E and serotype F are 
available from the American Type Culture Collection (ATCC: Rockville. MD). For example, 
the ATCC provides the following: Type A strains: 174 (ATCC 3502), 457 (ATCC 17862). 
and NCTC 7272 (ATCC 19397): Type B strains: 34 (ATCC 439). 62A (ATCC 7948). NCA 
213 B (ATCC 7949). 13114 (ATCC 8083). 3137 (ATCC 17780). 1347 (ATCC 17841). 2017 
(ATCC 17843). 2217 (ATCC 17844). 2254 (ATCC 17845) and VP 1731 (ATCC 25765): 
Type C strains: 2220 (ATCC 17782). 2239 (ATCC 17783), 2223 (ATCC 17784: a type C-|* 
strain: C-{\ strains produce C2 toxin). 662 (ATCC 17849; a type C-cc strain; C-a strains 
produce mainly CI toxin and a small amount of C2 toxin). 2021 (ATCC 17850: a type C-a 
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strain) and VP1 3803 (ATCC 25766); Type D strains: ATCC 9633. 2023 (ATCC 17851), and 
VPI 5995 (ATCC 27517); Type li strains: ATCC 43181. 36208 (ATCC 9564), 2231 (ATCC 
17786). 2229 (ATCC 17852). 2279 (ATCC 17854) and 2285 (ATCC 17855) and Type F 
strains: 202F (ATCC 23387). VPI 4404 (ATCC 25764). VPI 2382 (ATCC 27321) and 
Langeland (ATCC 35415). Type G strain. 1 13/30 (NCFB 3012) may be obtained from the 
National Collection of Food Bacteria (NCFB. AFRC Institute of Food Research. Reading, 
United Kingdom). 

Purification methods have been reported for native toxin types A. B. C. D, E. and F 
| reviewed in O. Sakaguchi. Pharmac. Ther. 19:165 (!983)j. As the different botulinal toxins 
are structurally related, the invention contemplates the expression of any of the botulinal 
toxins (<.'.#.. types A-CJ) as soluble recombinant fusion proteins. 

In particular, methods for purification of the type A botulinum neurotoxin have been 
developed [L.J. Moberg and H. Sugiyama. Appl. Hnviron. Microbiol, 35:878 (I978)|. 
Immunization of hens with detoxified purified protein results in the generation of neutralizing 
antibodies |B.S. Thalley et ai. in Botulinum and Tetanus Neurotoxins. B.R. DasGupta. cd,. 
Plenum Press, New York (1993). p. 467]. 

I he currently available (*. hotu/inum pentavalent vaccine comprising chemically 
inactivated </. c \. formaldehyde treated) type A. B. C. D and li toxins is not adequate. The 
efficacy is variable (in particular, only 78% of recipients produce protective levels of anti-type 
B antibodies following administration of the primary scries) and immunization is painful 
(deep subcutaneous inoculation is required for administration), with adverse reactions being 
common (moderate to severe local reactions occur in approximately 6% of recipients upon 
initial injection: this number rises to approximately 1 1% of individuals who receive booster 
injections) (Informational Brochure for the Pentavalent (ABCI)F) Botulinum Toxoid. Centers 
lor Disease Control). Preparation of this vaccine is dangerous as active toxin must be handled 
by laboratory workers. 

In general, chemical detoxification of bacterial toxins using agents such as 
lormaldchyde. glutaraldehyde or hydrogen peroxide is not optimal for the generation of 
vaccines or antitoxins. A delicate balance must be struck between too much and too little 
chemical modification. If the treatment is insufficient, the vaccine may retain residual 
toxicity. If the treatment is too excessive, the vaccine may lose potency due to destruction of 
native immunogenic determinants. Another major limitation of using botulinal toxoids for the 
generation of antitoxins or vaccines is the high production expense. For the above reasons. 
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the development of methods for the production of nontoxic but immunogenic C. hotulmum 
toxin proteins is desirable. 

The C hoiulinum and C. tetanus toxin proteins have similar structures [reviewed in 
F..J. Schantz and E.A. Johnson. Microbiol. Rev. 56:80 ( |992)|. The carboxy-ierminal 50 kD 
fragment of the tetanus toxin heavy chain (fragment C) is released by papain cleavage and has 
been shown to he non-toxic and immunogenic. Recombinant tetanus toxin fragment C has 
been developed as a candidate vaccine antigen [A.J. Makoff et al.. Bio/Technology 7:1043 
(I989)|. Mice immunized with recombinant tetanus toxin fragment C were protected from 
challenge with lethal closes of tetanus toxin. No studies have demonstrated that the 
recombinant tetanus fragment C protein confers immunity against other botulinal toxins such 
as the C. hnmlinum toxins. 

Recombinant tetanus fragment C has been expressed in E. coli (A.J. Makoff et al.. 
Bio/Technology, supra and Nucleic Acids Res. 17:10191 (1989); J.I.. Halpern et al.. Infect. 
Immun. 58:1004 (1990)j. ycasi [M.A. Romanes */ al.. Nucleic Acids Res. 19:1461 (199|)| 
and baculovirus [I.G. Charles e, al.. Infect. Immun. 59:1627 (199l)|. Synthetic tetanus toxin 
genes had to be constructed to facilitate expression in yeast (M.A. Romanos e, at., supra) and 
E. eoli [A.J. Makoff et al.. Nucleic Acids Res., supra), due to the high A-T content of the 
tetanus toxin gene sequences. High A-T content is a common feature of clostridial genes 
(M.R. PopoiTc/ al.. Infect. Immun. 59:3673 (1991); H.F. LaPeno.iere et a/., in Botulinum and 
Tetanus Xewotoxins. B.R. DasGupta. ed.. Plenum Press. New York (1993). p. 463| which 
creates expression difficulties in E. coli and yeast due primarily to altered codon usage 
freijueney and fortuitous polyadem lation sites. respectively. 

The C fragment of the C. hotulmum type A neurotoxin heavy chain has been evaluated 
as a vaccine candidate. The C. hntulinum type A neurotoxin gene has been cloned and 
sequenced [D.H. Thompson c, al.. Eur. J. Biochem. 189:73 (I990)|. The C fragment of the 
type A toxin was expressed as either a fusion protein comprising the botulinal C fragment 
fused with the maltose binding protein (MBP) or as a native protein [H.F. l.aPenotiere et al.. 
(1993) supra. H.F. LaPenotiere e, al.. Toxicon. 33:1383 ( 1995) and Middlebrook and Brown 
(1995). Curr. Top. Microbiol. Immunol. 195:89-1221. The plasmid construct encoding ,he 
native protein was reported to be unstable, while the fusion protein was expressed primarily in 
inclusion bodies as insoluble protein. Immunization of mice with crudely purified MBP 
fusion protein resulted in protection against IP challenge with 3 LD,„ doses of toxin 
U-aPenotiere et al.. (1993) and (.995). supra). However, this recombinant C. hotulmum tvpe 
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A toxin C fragment/MBP fusion protein is not a suitable immunogen for the production of 
vaccines as it is expressed as an insoluble protein in E. coli. Furthermore, this recombinant 
C. hotidinum type A toxin C fragment/MBP fusion protein was not shown to be substantially 
free of endotoxin contamination. Experience with recombinant C. hotidinum type A toxin C 
fragment/MBP fusion proteins shows that the presence of the MBP on the fusion protein 
greatly complicates the removal of endotoxin from preparations of the recombinant fusion 
protein [sec Ex. 24. infra). Expression of a synthetic gene encoding (' hotidinum type A 
toxin C fragment as a soluble protein excreted from insect cells has been reported 
[Middlcbrook and Brown (1995). supra]: no details regarding the level of expression achieved 
or the presence of endotoxin or other pyrogens were provided. Like the insoluble protein 
expressed in E. coli. immunization with the recombinant protein produced in insect cells was 
reported to protect mice from challenge with C hotidinum toxin A. 

Inclusion body protein must be solubilized prior to purification and/or administration 
to a host. The harsh treatment of inclusion body protein needed to accomplish this 
solubilization may reduce the immunogenicity of the purified protein. Ideally, recombinant 
proteins to be used as vaccines are expressed as soluble proteins at high levels (i.e.. greater 
than or equal lo about 0.75% of total cellular protein) in E. coli or other host cells (t\#., 
yeast, insect cells, etc.). This facilitates the production and isolation of sufficient quantities of 
the immunogen in a highly purified form (i.e.. substantially free of endotoxin or other 
pyrogen contamination). The ability to express recombinant toxin proteins as soluble proteins 
in £ coli is advantageous due to the low cost of growth compared to insect or mammalian 
tissue culture cells. 

The ( \ hotidinum type B neurotoxin gene has been cloned and sequenced from two 
strains of ('. hotidinum type B [Whelan et ai (1992) Appl. Environ. Microbiol. 58:2345 
(Danish strain) and Hutson at ai (1994) Curr. Microbiol. 28:101 (Eklund 17B strain)]. The 
nucleotide sequence of the toxin gene derived from the Eklund I7B strain (ATCC 25765) is 
available from the EMBL/GenBank sequence data banks under the accession number X7I343: 
the nucleotide sequence of the coding region is listed in SEQ ID NO:39. The amino acid 
sequence of the C. hotidinum type B neurotoxin derived from the strain Eklund 17B is listed 
in SEQ ID NO:4(). The nucleotide sequence of the hotidinum serotype B toxin gene 
derived from the Danish strain is listed in SEQ ID NO:41. The amino acid sequence of the 
('. hotidinum type B neurotoxin derived from the Danish strain is listed in SEQ ID NO:42. 
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The C. botulinum type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds. The light chain is responsible for pharmacological activity (i.e.. inhibition of 
the release of acetylcholine at the neuromuscular junction). The N-terminal portion of the 
5 heavy chain is thought to mediate channel formation while the C-terminal portion mediates 
toxin binding: .he type B neurotoxin has been reported to exist as a mixture of predominantly 
single chain with some double chain (Whelan e, ai.. .supra). The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the I I, domain. The present 
invention reports for the llrst time, the expression of the C fragment of C. botulinum type B 
0 toxin in heterologous hosts (<?.#., E. coli). 

I he C. botulinum type F. neurotoxin gene has been cloned and sequenced from a 
number of different strains [Poulct et ai. (1992) Biochem. Biophys. Res. Commun. 183:107; 
Whelan et ai. (1992) l£ur. J. Biochem. 204:657: and Fujii et ai. (1993) J. Gen. Microbiol. 
I30:7Q|. The nucleotide sequence of the type E toxin gene is available from the EMBL 
sequence data bank under accession numbers X62089 (strain Beluga) and X62683 (strain 
NCTC 1 1219): the nucleotide sequence of the coding region (strain Beluga) is listed in SF-Q 
10 NO:45. The amino acid sequence of the C botulinum type E neurotoxin derived from 
strain Beluga is listed in SEQ ID NO:46. The type !• neurotoxin gene is synthesized as a 
single polypeptide chain which may be converted to a double-chain form (i.e.. a heavy chain 
and a light chain) by cleavage with trypsin: unlike the type A neurotoxin, the type F 
neurotoxin exists essentially only in the single-chain form. The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the H r domain. The present 
invention reports for the llrst time, the expression of the C fragment off. botulinum type F, 
toxin in heterologous hosts ie.}>.. £'. coli). 

The C. botulinum type CI. D. F and G neurotoxin genes have been cloned and 
sequenced. The nucleotide and amino acid sequences of these genes and toxins are provided 
herein. The invention provides methods lor the expression of the C fragment from each of 
these toxin genes in heterologous hosts and the purification of the resulting recombinant 
proteins. 

The subject invention provides methods which allow the production of soluble ('. 
bomlinum toxin proteins in economical host cells (..*.. E coli). In addition the subject 
invention provides methods which allow the production of soluble botulinal toxin proteins in 
yeast and insect cells. Further, methods for the isolation of purified soluble C. botulinum 
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toxin proteins which are suitable for immunization of humans and other animals are provided. 
These soluble, purified preparations of C botulinum toxin proteins provide the basis for 
improved vaccine preparations and facilitate the production of antitoxin. 

When recombinant clostridial toxin proteins produced in gram-negative bacteria (<.'.#., 
£ volt) arc used as vaccines, they are purified to remove endotoxin prior to administration to 
a host animal. In order to vaccinate a host, an immunogenieally-effective amount of purified 
substantially endutoxin-free recombinant clostridial toxin protein is administered in anv of a 
number of physiologically acceptable carriers known to the art. When administered for the 
purpose of vaccination, the purified substantially endotoxin-free recombinant clostridial toxin 
protein may be used alone or in conjunction with known adjutants, including potassium alum, 
aluminum phosphate, aluminum hydroxide, Gerbu adjuvant (GmDP: C.C. Biotech Corp.), 
RIBI adjuvant (MPL; RJBI Immunochemical Research. Inc.). QS2I (Cambridge Biotech). 
The alum and aluminum-based adjutants are particularly preferred when vaccines are to be 
administered to humans: however, any adjuvant approved for use in humans mav be 
employed. The route of immunization may be nasal, oral, intramuscular, intraperitoneal or 
subcutaneous. 

The invention contemplates the use of soluble, substantially endotoxin-free 
preparations of fusion proteins comprising the C fragment of the ( '. hotulinum type A. B, C. 
D. E. F. and Ci toxin as vaccines. In one embodiment, the vaccine comprises the C fragment 
of either the C\ hotulinum type A. B, C, D, E. F. or G toxin and a poly-histidine tract (also 
called a histidine tag). In a particularly preferred embodiment, a fusion protein comprising 
the histidine tagged C fragment is expressed using the pHT series of expression vectors 
(Novagen). The pET expression system utilizes a vector containing the T7 promoter which 
encodes the fusion protein and a host cell which can be induced to express the T7 DNA 
polymerase {/.<»., a DE3 host strain). The production of C fragment fusion proteins containing 
a histidine tract is not limited to the use of a particular expression vector and host strain. 
Several commercially available expression vectors and host strains can be used to express the 
C fragment protein sequences as a fusion protein containing a histidine tract (For example, the 
pQE series (pQE-8. 12, 16, 17. 18. 30, 31. 32. 40. 41. 42. 50. 51. 52. 60 and 70) of 
expression vectors (Qiagcn) which are used with the host strains MI5fpREP41 (Qiagen) and 
SG13009[pREP4| (Qiagen) can be used to express fusion proteins containing six histidine 
residues at the amino-terminus of the fusion protein). Furthermore a number of commercially 
available expression vectors which provide a histidine tract also provide a protease cleavage 
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site between the histidine tract and the protein of interest U:^ botulinal toxin sequences). 
Cleavage of the resulting fusion protein with the appropriate protease will remove the 
histidine tag from the protein of interest {e.g.. botulinal toxin sequences) (see Example 28a. 
infra). Removal of the histidine tag may be desirable prior to administration of the 
recombinant botulinal toxin protein to a subject { e . K .. a human). 

VI. Detection Of Toxin 

The invention contemplates detecting bacterial toxin in a sample. The term "sample" 
in the present specification and claims is used in its broadest sense. On the one hand it is 
meant to include a specimen or culture. On the other hand, it is meant to include both 
biological and environmental samples. 

biological samples may be animal, including human, fluid, solid st00 |) or tissuc: 
liquid and solid food products and ingredients such as dairy items, vegetables, meat and meat 
hy-prodi.ci.s. and waste. Environmental samples include environmental material such as 
surface matter, soil, water and industrial samples, as well as samples obtained from food and 
dairy processing instruments, apparatus, equipment, disposable and non-disposable .terns. 
These examples are not to be construed as limiting the sample types applicable to the present 
invention. 

The invention contemplates detecting bacterial toxin by a competitive immunoassay 
method that utilizes recombinant toxin A and toxin B proteins, antibodies raised against 
recombinant bacterial toxin proteins. A fixed amount of the recombinant toxin proteins are 
immobilized to a solid support {c y .. a microliter plate) followed by the addition of a 
biological sample suspected of containing a bacterial toxin. The biological sample is first 
mixed with amnity-purified or PEG fractionated antibodies directed against the recombinant 
loxin protein. A reporter reagent is then added which is capable of detecting the presence of 
antibody bound to the immobilized toxin protein. The reporter substance may comprise an 
antibody with binding specificity for the antitoxin attached to a molecule which is used to 
identify ,he presence of the reporter substance. If toxin is present in the sample, this toxin 
will compete with the immobilized recombinant toxin protein for binding to the anti- 
rccombinan, antibody thereby reducing the signal obtained following ,he addition of the 
reporter reagent. A control is employed where the antibody is no, mixed with the sample. 
This gives the highest (or reference) signal. 
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The invention also contemplates detecting bacterial toxin by a "sandwich" 
immunoassay method that utilizes antibodies directed against recombinant bacterial toxin 
proteins. Affinity-purified antibodies directed against recombinant bacterial toxin proteins are 
immobilized to a solid support (e.tf.. microliter plates), biological samples suspected of 
containing bacterial toxins are then added followed by a washing step to remove substantially 
all unbound antitoxin. The biological sample is next exposed to the reporter substance, which 
binds to antitoxin and is then washed free of substantially all unbound reporter substance. 
The reporter substance may comprise an antibody with binding specificity for the antitoxin 
attached to a molecule which is used to identify the presence of the reporter substance. 
Identification of the reporter substance in the biological tissue indicates the presence of the 
bacterial toxin. 

It is also contemplated that bacterial toxin be detected by pouring liquids (t'-tf.. soups 
and other fluid foods and feeds including nutritional supplements for humans and other 
animals) over immobilized antibody which is directed against the haclerial toxin. It is 
contemplated that the immobilized antibody will be present in or on such supports as 
cartridges, columns, beads, or any other solid support medium. In one embodiment, following 
the exposure of the liquid to the immobilized antibody, unbound toxin is substantially 
removed by washing. 'The exposure of the liquid is then exposed to a reporter substance 
which detects the presence of bound toxin. In a preferred embodiment the reporter substance 
is an enzyme, fluorescent dye. or radioactive compound attached to an antibody which is 
directed against the toxin (i.e., in a "sandwich" immunoassay). It is also contemplated that 
the detection system will be developed as necessary ( t \ ( *».. the addition of enzyme substrate in 
enzyme systems: observation using fluorescent light for fluorescent dye svstems: and 
quantitation of radioactivity for radioactive systems). 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the disclosure which follows, the following abbreviations apply: °C (degrees 
Centigrade): rpm (revolutions per minute): BBS-Tween (borate buffered saline containing 
Tween): BSA (bovine serum albumin): KLISA (enzyme-linked immunosorbent assay): IT A 
(complete Freund"s adjuvant): ITA (incomplete Freuiufs adjuvant): IgCi (immunoglobulin G); 
IgY (immunoglobulin Y): IM (intramuscular): IP (intraperitoneal): IV (intravenous or 
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intravascular): SC (subcutaneous): H,0 (water): HCI (hydrochloric acid): LD I()() (lethal dose 
lor 100% of experimental animals): aa (amino acid): HPLC (high performance liquid 
chromatography): kD (kilodaltons): gm (grams): ug (micrograms): nig (milligrams): ng 
(nanograms): ul (microliters): ml (milliliters): mm (millimeters): nm (nanometers): urn 
(micrometer): M (molar): mM (millimolar): MW (molecular weight): sec (seconds): min(s) 
(minuie/minutesi: hns) (hour/hours): MgCL (magnesium chloride): NaCI (sodium chloride): 

Na.XO: (sodium carbonate): <)D. W , (optical density at 280 nm): OD (optical density at 600 

nm): PAGE (polyacrylamide gel electrophoresis): PBS Iphosphute buffered saline (150 mM 
NaCI. 10 mM sodium phosphate buffer. pH 7.2)j: PEG (polyethylene glycol): PMSF 
(phenyimethylsulfonyl fluoride): SDS (sodium dodecyl sulfate): Tris 

(tristhydroxymcthyhammomcthane): Ensure* (Ensure !<:. Ross Laboratories. Columbus Oil): 
linfamiU (l-nfamil-K. Mead Johnson): w/v (weigh, , 0 volume): v/v (volume lo volume): 
Amicon (Amicon. Inc.. Beverly. MA): Amresco (Amresco. Inc.. Solon. Oil): ATCC 
(American Type Cul.urc Collection. Rockville. MD>: BBI. (Baltimore Biologies Laboratory. 
0. division of Beco., Dickinson). Cockeysville. MD): Becton Dickinson (Becton Dickinson 
Labware. Lincoln Park. NJ): BioRad (BioRad. Richmond. CA): Biotech (C-C Biotech Corp.. 
I'oway. CA): Charles River (Charles River Laboratories. Wilmington. MA): Coealico 
(Cocalieo Biologicals Inc.. Reamstown. PA): CytRx (CytRx Corp.. Norcross. CiA): falcon 
U'.tf. Baxter Healthcare C orp.. McCiaw Park. II. and Becton Dickinson): I DA (federal food 
and Drug Administration): fisher Biotech (fisher Biotech. Springfield. NJ): GIBCO (Grand 
Island Biologic Company/URL. Grand Island. NY): Gibco-BRL (Life Technologies. Inc.. 
Gaithershurg. MD): Harlan Sprague Dawley (Harlan Sprague Dawley. Inc.. Madison. Wl): 
Mallinckrodt (a division of Baxter Healthcare Corp.. McGaw Park. IL): Millipore (Millipore 
Corp.. Marlborough. MA): New England Biolabs (New England Biolabs. Inc.. Beverly. MA): 
Novagen (Novagen. Inc.. Madison. Wl); Pharmacia (Pharmacia. Inc.. Piscataway. NJ); Oiagen 
(Qiagcn. Chatsworth. CA): Sasco (Sasco. Omaha. NE): Showdcx (Shovva Denko America/ 
Inc.. New York. NY): Sigma (Sigma Chemical Co.. St. Louis. MO): Stcrogene (Sterogene. 
Inc.. Arcadia. CA): Tech Lab (Tech Lab. Inc.. Blacksburg. VA): and Vaxcell (Vaxcell. Inc.. 
a subsidiary of C'ytRX Corp.. Norcross. (iA). 

When a recombinant protein is described in the specification it is referred to in a 
short-hand manner by the amino acids in the toxin sequence present in the recombinant 
protein rounded lo the nearest 10. for example, the recombinant protein pMIi'l 850-2360 
contains amino acids 1852 through 2362 of the C. ,/#V//e toxin B protein. The specification 
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gives detailed construction details for all recombinant proteins such that one skilled in the art 
will know precisely which amino acids are present in a given recombinant protein. 

KXAMPLE ! 

Production Of High-Titcr Antibodies To Clostridium difficile Organisms In A Hen 

Antibodies to certain pathogenic organisms have been shown to be effective in treatinu 
diseases caused by those organisms. It has not been shown whether antibodies can be raised, 
against Clostridium difficile, which would be effective in treating infection by this organism. 
Accordingly. ('. difficile was tested as immunogen for production of hen antibodies. 

lo determine the best course for raising high-tiler egg antibodies against whole C. 

organisms, different immunizing strains and different immunizing concentrations were 
examined. The example involved (a) preparation of the bacterial immunogen. 
<h) immunization, (c) purification of anti-bacterial chicken antibodies, and (d) detection of 
anti-bacterial antibodies in the purified IgY preparations. 

a) Preparation Of Bacterial Immunogen 

( '. difficile strains 43594 (serogroup A) and 43596 (serogroup C) were originally 
obtained from the ATC.W These two strains were selected because they represent two of the 
most commonl\ -occurring serogroups isolated from patients with antibiotic-associated 
pseudomembranous colitis. (Delmee et aL. J. Clin. Microbiol.. 28( 10):2210 ( 1990).| 
Additionally, both of these strains have been previously characterized with respect to their 
virulence in the Syrian hamster model for C difficile infection. [Delmec et aL. J. Med 
Microbiol.. 33:85 

I he bacterial strains were separately cultured on brain heart infusion agar for 48 hours 
at 37°C in a Gas Pack 100 Jar (BBL. Cockeysvillc. MD) equipped with a Gas Pnck Plus 
anaerobic envelope (BUI.). Tony-eight hour cultures were used because they produce better 
growth and the organisms have been found to be more cross-reactive with respect lo their 
surface antigen presentation. The greater the degree of cross-reactivity of our IgY 
preparations, the better the probability of a broad range of activity against different 
strains/serogroups. |Toma et aL, J. Clin. Microbiol.. 26(3):426 (1988).| 

1 he resulting organisms were removed from the agar surface using a sterile dacron-tip 
swab, and were suspended in a solution containing 0.4% formaldehyde in PBS. pi I 7.2. This 
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concentration of formaldehyde has heen reported as producing good results for the purpose of 
preparing whole-organism immunogen suspensions for the generation of polyclonal anti-C 
difficile antisera in rabbits. fDelmee et al.. J. Clin. Microbiol.. 21:323 (1985): Davies et al.. 
Microbial Path.. 9:141 (1990).] In this manner, two separate bacterial suspensions were 
prepared, one for each strain. The two suspensions were then incubated at 4°C for I hour, 
following this period of formalin-treatment, the suspensions were centrifuged at 4.200 x g for 
20 min.. and the resulting pellets were washed twice in normal saline. The washed pellets, 
which contained formalin-treated whole organisms, were resuspended in fresh normal saline 
such thai the visual turbidity of each suspension corresponded to a til McFariand standard. 
(M.A.C. IZdclstcin. "Pmcessinf; Clinical Specimens for Anaerahic Bacteria: Isolation and 
Identification Procedures." in S.M. Finegold el al (eds.).. Bailey and Scot, s Diagnostic 
Microbiology, pp. 477-507. C.V. Mosby Co.. (1990). The preparation of McFariand 
nephelometer standards and the corresponding approximate number of organisms for each 
it.be arc described in detail at pp. 172-173 of this volume. | Kach of the two til suspensions 
was then split into two separate volumes. One volume of each suspension was volumetric-ally 
adjusted, by the addition of saline, to correspond to the visual turbidity of a ti\ McFariand 
standard. |/,/.| The it] suspensions contained approximately 3 x 10 s organisms/ml. and the 
til suspensions contained approximately 2x10" organisms/ml. \/d.\ The four resulting 
concentration-adjusted suspensions of formalin-treated C difficile organisms were considered 
io be "bacterial immunogen suspensions." These suspensions were used immediately after 
preparation for the initial immunization. [See section (b).| 

The formalin-treatment procedure did not result in 100% non-viable bacteria in the 
immunogen suspensions. In order to increase the level of killing, the formalin concentration 
and length of treatment were both increased for subsequent immunogen preparations, as 
described below in fable 3. (Although viability was decreased with the stronger formalin 
treatment. 100% inviabilitv of the bacterial immunogen suspensions was not reached.) Also, 
in subsequent immunogen preparations, the formalin solutions were prepared in normal saline 
instead of PUS. At day 49. the day of the fifth immunization, the excess volumes of the four 
previous bacterial immunogen suspensions were stored frozen at -7()°C for use during all 
subsequent immunizations. 
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l>) Immunization 

Tor the initial immunizaiion. 1.0 ml volumes of each of the four bacterial immunogcn 
suspensions described above were separately emulsilied in 1.2 ml volumes of CFA (GIBCO). 
f or each of the lour emulsilied immunogen suspensions, two four-month old White Leghorn 
hens (prepaying) were immunized. (It is not necessary to use pre-laying hens: actively-laying 
hens can also be utilized.) Each hen received a total volume of approximately 1.0 ml of a 
single emulsilied immunogen suspension via four injections (two subcutaneous and two 
intramuscular) of approximately 250 uj per site. In this manner, a total of four different 
immunization combinations, using two hens per combination, were initiated for the purpose of 
evaluating both the effect of immunizing concentration on egg yolk antibody (IgY) 
production, and intersirain cross-reactivity of IgY raised against heterologous strains. The 
four immunization groups are summarized in Table 3. 

TABLE 3 



Group Designation 


Immunizing Strain 


Approximate Im muni /.in a Dose 


C I) 43594. «l 


( \ difficile 
strain 43594 


1.5* 10 s organisms- hen 


( f) 43594. ;.-7 




1 0 * 1(1 organisms, hen 


CD 435%. "1 


( *. difficile 
strain 435% 


1 .5 y 10 s organisms/ hen 


CD 43596. "7 




1.0 * 10' organ isnis. r hen 



I he lime point for the first series of immunizations was designated as "day zero. 1 ' All 
subsequent immunizations were performed as described above except that the bacterial 
immunouen suspensions were emulsified using II'A ((ilUCO) instead of CI A. and for the 
later lime point immunization, the stored frozen suspensions were used instead of freshly- 
prepared suspensions. The immunization schedule used is listed in Table 4. 
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TABLE 4 



Day OC Immunization 


Formalin-Treatment 


| Immunogcn Preparation Used 


0 


1%. 1 hr. 


freshly-prepared 


14 


1%. overnight 




-> i 


overniiiht 




;o 


IV 48 hrs. 




49 


1%, 72 hrs. 




70 




stored frozen 


x> 






105 







i ) Purification Of Anti-Bacterial Chicken Antibodies 

(iroups of lour eggs were collected per immunization group between days 80 and X4 
post-initial immunization, and chicken immunoglobulin (IgY) was extracted according to a 
modilleaiion of the procedure of A. Poison el a/.. Immunol. Comm.. 9:405 (1980). A gentle 
stream of distilled water from a squirt bottle was used to separate the yolks from the whites, 
and the yolks ucrc broken by dropping them through a funnel into a graduated cylinder. The 
lour indiv idual yolks were pooled for each group. The pooled, broken yolks were blended 
uilb 4 volumes of egg extraction buffer to improve antibody yield (egg extraction buffer is 
0.01 M sodium phosphate. 0.1 M Nad. pi I 7.5. containing 0.005% tbimerosal). and PlXi 
S000 (Amresco) was added to a concentration of 3.5%. When all the PlXi dissolved, the 
protein precipitates that formed were pelleted by centrifuyation at 1 3.000 x g f or |() m i nulcs . 
The supernatants were decanted and tillered through cheesecloth to remove the lipid layer, 
and the PlXi was added to the supernatants to a final concentration of 12% (the supernatants 
were assumed to contain 3.5% PEG). Alter a second ccntrilugation. the supernatants were 
discarded and the pellets were centrifuged a final time to extrude the remaining PlXi. These 
crude IgY pellets were then dissolved in the original yolk volume of egg extraction buffer and 
stored at 4°C\ As an additional control, a preimmune IgY solution was prepared as described 
above, using eggs collected from unimmunized hens. 
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d) Detection Of Anti-Bacterial Antibodies In The Purified I^Y 
Preparations 

In order to evaluate the relative levels of specific anii-C. difficile activity in the IgY 
preparations described above, a modified version of the whole-organism ELISA procedure of 
N.V. Padhye el al.. 3. Clin. Microbiol. 29:99-103 (1990) was used. Frozen organisms of both 
(\ difficile strains described above were thawed and diluted to a concentration of 
approximately 1*10' organisms/ml using PBS. pi I 7,2. In this way. two separate coating 
suspensions were prepared, one for each immunizing strain. Into the wells of %-well 
microliter plates (Falcon. Pro-Bind Assay Plates) were placed 100 l0 volumes of the coating 
suspensions. In this manner, each plate well received a total of approximately I x 10*' 
organisms of one strain or the other. The plates were then incubated at 4°C overnight. The 
next morning, the coaling suspensions were decanted, and all wells were washed three times 
using PBS. In order to block non-specific binding sites. 100 ul of 0.5% BSA (Sigma) in PBS 
was then added to each well, and the plates were incubated for 2 hours at room temperature. 
The blocking solution was decanted, and 100 j.t! volumes of the IgY preparations described 
above were initially diluted 1:500 with a solution of 0.1% BSA in PBS. and then serially 
diluted in 1:5 steps. The following dilutions were placed in the wells: 1:500. 1:2.500. 
1:62.5000. I:.! 12.500. and 1:1.562.500. The plates were again incubated for 2 hours at room 
temperature. Following this incubation, the IgY-containing solutions were decanted, and the 
wells were washed three times using BBS-Tween (0.1 M boric acid. 0.025 M sodium borate. 
1.0 M Na('l. 0.1% Tween-20). followed by two washes using PBS-Twecn (0.1% Twoen-20). 
and finally, two washes using PBS only. To each well. 100 ^1 of a 1:750 dilution of rabbit 
anti-chicken lg(i ( whole-moleculel-alkaline phosphatase conjugate (Sigma) (diluted in 0.1% 
BSA in PBS) was added. The plates were again incubated for 2 hours at room temperature. 
The conjugate solutions were decanted and the plates were washed as described above, 
substituting 50 mM Na,CO,. pi I 9.5 for the PBS in the final wash. The plates were 
developed by the addition of 100 M l of a solution containing 1 mg/ml para-nitrophenyl 
phosphate (Sigma) dissolved in 50 mM Na : CO ? . 10 mM MgCF. pH <).5 io each well, and 
incubating the plates at room temperature in the dark for 45 minutes. The absorbance of each 
well was measured at 410 nm using a Dynaiech MR 700 plate reader. In this manner, each of 
the lour IgY preparations described above was tested for reactivity against both of the 
immunizing ('. difficile strains: strain-specific, as well as cross-reactive activity was 
determined. 
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Tabic 5 shows the results of the whole-organism ELISA. All four IgY preparations 
demonstrated significant levels of activity, to a dilution of 1:62.500 or greater against both of 
ihe immunizing organism strains. Therefore, antibodies raised against one strain were highly 
cross-reactive with the other strain, and vice versa. The immunizing concentration of 
organisms did not have a significant effect on organism-specille IgY production, as both 
concentrations produced approximately equivalent responses. Therefore, the lower 
immunizing concentration of approximately 1.5 x 10 s organisms/hen is the preferred 
immunizing concentration of the two tested. The prcimmunc IgY preparation appeared to 
possess relatively low levels of C. c/,//,V//,-reactive activity ,o a dilution of 1 :50(). probably 
due to prior exposure of the animals to environmental Clostridia. 

An .ni.ial whole-organism FLISA was performed using IgY preparations made from 
single CD 4>5o 4 . U\ and CD 43506. U\ eggs collected around day 50 (data not shown,. 
Specific titers were found to be 5 to 1 0-fold lower than those reported in Table 5. These 
results demonstrate that it is possible to begin immunizing hens prior to the time that thev 
begin to lav eggs, and to obtain high titer specific IgY from the lirs. eggs that are laid. In 
other words, i, is no, necessary „> wait for the hens to begin laying before the immunization 
schedule is started. 
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TABLF. 5 



Results Of The Anti-C. difficile Whole-Organism ELiSA 



luY Preparation 


Dilution Of IgY Prep 


43594-Coatcd Wells 


43596-Coated Wells 




1 :500 


1 .746 


1.801 




1 :2.500 


1 .092 


1.670 


CD 43594. n\ 


1 : 1 2.500 




0.8 1 2 




1:62.500 


0.136 


0.1 79 




1:312,500 


0.012 


0.080 




1:1.562.500 


0.002 


0.020 




1 :500 


1 .780 


1.771 




1 :2.500 


1 .025 


1.078 


C D 43594. pi 


1 : 1 2.500 


O 1 if If 


0.382 




1:62.500 


0.052 


0.132 




1:312.500 


0.022 


0.043 




1:1.562.500 


0.005 


0.024 




1 : 500 


1 .526 


1.790 




1 .2..M/U 


0.832 


1,177 


U> 4:>5%. n 1 




w.J4 / 


0,452 




1:62.500 


0.050 


0.242 




1:312.500 


0.010 


0.067 




1:1.562.500 


0.000 


0.036 




1:500 


1.702 


1.505 




1 :2.50() 


0.706 


0.866 


v. I > 4> y)u. t: I 


1:12.500 


(1.250 


0.282 




1:62.500 


0.0 j o 


0.078 




1:312.500 


0.002 


0.017 




1:1.562.500 


0.000 


0.010 




1:500 


0.142 


0.30*) 




1:2.500 


0.032 


0.077 


Pre immune l»Y 


1:12.500 


0.006 


0.024 




1:62.500 


0.002 


0.012 




1:312.500 


0.004 


0.010 




1:1.562.500 


0.002 


0.014 



EXAMPLE 2 

Treatment Of ( '. difficile Infection With Anti-C difficile Antibody 



In order to determine whether the immune IgY antibodies raised against whole C. 
difficile organisms were capable of inhibiting the infection of hamsters by ('. difficile. 
hamsters infected by these bacteria were utilized. (Lyerly et at.. Infect. Immun.. 5 C >:221 5- 
221 X (1991 ).| This example involved; (a) determination of the lethal dose of (.*. difficile 
organisms: and (b) treatment of infected animals wiih immune antibody or control antibody in 
nutritional solution. 
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a) Determination Of The Lethal Dose Of C. difficile Organisms 
Determination of the lethal dose off. difficile organisms was carried out according to 
the model described by D.M. Lyerly ei al.. Infect. Immun.. 59:2215-2218 (1991). c ttifficile 
strain ATCC 43596 (serogroup C. ATCC) was plated on BHI agar and grown anaerobically 
(RBI. Gas Pak 100 system) at 37°C for 42 hours. Organisms were removed from the agar 
surface using a sterile dacron-tip swab and suspended in sterile 0.9% NaCl solution to a 
densiiy of 10* organisms/ml. 

In order to determine the lethal dose off. difficile in the presence of control antibody 
and nutritional formula, non-immune eggs were obtained from unimmunized hens and a 12% 
PEG reparation made as described in Example 1(c). This preparation was redissolved in one 
fourth the original yolk volume of vanilla flavor Ensure*. 

Starting on das one. groups of female Golden Syrian hamsters (Harlan Sprague 
Dawlcy). 8-9 weeks old and weighing approximately 100 gm. were oralis administered I oil 
of the preimmune/r-nsurcK- formula at time zero. 2 hours. 6 hours, and 10 hours. Al I hour, 
animals were orally administered 3.0 mg clindamycin HCI (Sigma) in I ml of water. This 
drug predisposes hamsters to C. difficile infection by altering the normal intestinal flora. On 
day .wo. the animals were given I ml of the preimmune IgY/Ensure* formula at time zero. 2 
bours. 6 hours, and 10 hours. A. I hour on day two. different groups of animals were 
inoculated orally with saline (control), or 10-'. 10\ 10". or 10* V. difficile organisms in I ml 
of saline, f rom days 3-12. animals were given I ml of the preimmune IgY/Ensure «• formula 
lha-c times daily and observed for the onset of diarrhea and death. Each animal was housed 
in an indiv idual cage and was offered food and water ad libitum. 

Administration of 10* - 10" organisms resulted in death in 3-4 days while the lower 
doses of 10- - I,)' organ.sms caused death in 5 days. Cecal swabs taken from dead animals 
indicated the presence of < : difficile. Given the effectiveness of the 10-' dose, this number of 
organisms was chosen for the following experiment to see if hyperimmune anti-C. difficile 
antibody could block infection. 

b) Treatment Of Infected Animals With Immune Antibody Or 
Control Antibody In Nutritional Formula 

The experiment in (a) was repeated using three groups of seven hamsters each. Group 
A received no.elindamycin or C. difficile and was the survival control. Group B received 
clindamycin. 10- C. difficile organisms and preimmune IgY on the same schedule as the 
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animais in (a) above. Group C received clindamycin. difficile organisms, and 

hyperimmune anti-C difficile IgY on the same schedule as (iroup B. The anti-C difficile IgY 
was prepared as described in Example 1 except that the 12% PI-G preparation was dissolved 
in one fourth the original yolk volume of Ensure*;. 

All animals were observed for the onset of diarrhea or other disease svmptoms and 
death. Each animal was housed in an individual cage and was offered food and water ad 
lihiium. The results are shown in Table 6. 



TABLE 6 

The Effect 01' Oral Feeding Of Hyperimmune leY Antibody on ( '. difficile infection 



10 



Animal Group 


Time To Diarrhea 4 


Time To Death' 1 


A pre- immune IgY only 


no diarrhea 


no deaths 


1* Clindamycin. ('. difficile, preimmune IgY 


M) hrs. 


49 hrs. 


C C lindamycin. (' difficile, immune IgY 


VI hrs. 


56 hrs, 



Mean of seven animals. 



Hamsters in the control group A did not develop diarrhea and remained healthy during 
the experimental period. Hamsters in groups B and C developed diarrheal disease. Anti-C. 
difficile ly Y did not protect the animals from diarrhea or death, all animals succumbed in the 
same lime interval as the animals treated with preimmune IgY. Thus, while immunization 
with whole organisms apparently can improve sub-lethal symptoms with particular bacteria 
(see U.S. Patent No. 5.080.895 to IT Tokoro). such an approach does not prove to be 
productive to protect against the lethal effects of ('. difficile. 



EXAMPLF 3 

Production of ( ' hotulimtm Type A Antitoxin in Hens 

In order to determine whether antibodies could be raised against the toxin produced bv 
clostridial pathogens, which would be effective in treating clostridial diseases, antitoxin to ('. 
hotulimtm type A toxin was produced. This example invokes: (a) toxin modification: (b) 
immunization: (c) antitoxin collection: (d) antigenicity assessment: and (e) assay of antitoxin 
liter. 
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a) Toxin Modification 

C hnmlimm type A toxoid was obtained from B. R. DasGupta. From this, the active 
type A neurotoxin (M.W. approximately 150 kD) was purified to greater than 99% pilrit> , 
according to published methods. [B.R. DasGupta & V. Sathyamoorthy. Toxicon. 22:415 
(1984).] The neurotoxin was detoxified with formaldehyde according to published methods. 
|U.R. Singh & B.R. DasGupta. Toxicon. 27:403 (1989).| 

•>) Immunization 

C hoiulmum toxoid lor immunization was dissolved in PBS ( 1 my/ml) and was 
emulsified with an approximately equal volume of CFA (GIBCO) lor initial immunization or 
M A |»r booster immunization. On day zero, two white leghorn hens, obtained from local 
breeders, were each injected at multiple sites (intramuscular and subcutaneous) with 1 ml 
inae.ix-a.ed toxoid emulsified in I ml CFA. Subsequent boos.er immunizations were made 
according to the following schedule for day of injection and toxoid amount: davs 14 and "U - 
0.5 mg: day 17. - 0.75 mg: days 394. 401. 409 - 0.25 mg. One hen received an additional 
booster <>T 0.150 mu on day 544. 

O Antitoxin Collection 

Total yolk immunoglobulin (IgY) was extracted as described in Fxample 1(c) and the 
IgY pellet was dissolved in the original yolk volume of PBS with thimerosal. 

d) Antigenicity Assessment 

l-ygs were collected from day 409 through day 423 to assess whether the toxoid was 
sufficiently immunogenic to raise antibody. Eggs from the two hens were pooled and 
antibody was collected as described in the standard PEG protocol. [Example 1(c).] 
Antigenicity of the botulinal toxin was assessed on Western blots. The 150 kD detoxified 
type A neurotoxin and unmodified, toxic. 300 kD botulinal type A complex (toxin used for 
intragastric route administration lor animal gut neutralization experiments: see Kxample 6) 
were separated on a SDS-polyacrylamide reducing gel. The Western blot technique was 
performed according to the method of Towbin. fll. Towbin c, aL Proc. Natl. Acad. Sci. 
l.'SA. 76:4350 (1979).] Ten ag samples ofC. W,„,„„ compk , x and (OX(m| wrc 
•n SDS reducing sample buffer ,1% SIXS. 0.5% 2-mercaptoe,hanol. 50 mM Ins. pi I 6.8. 10% 
glycerol. 0.025% w/v bromplienol blue. 10% (J-mercaptoethanol). heated at 95«C lor 10 min 
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and separated on a 1 mm thick 5% SDS-polyacrylamide gel. |K. Weber and M. 
Osborn" Proteins and Sodium Dodecy I Sulfate: Molecular Weight Determination on 
Polyaaylamide (lets and Related Procedures" in The Proteins. 3d Edition (H. Neurath & 
R-L. Mill. eds). pp. 179-223, (Academic Press. NY. 1975).) Part of the gel was cm off and 
the proteins were stained with Coomassie Blue. The proteins in the remainder of the gel were 
transferred to nitrocellulose using the Milliblot-SDE electro-blotting system (Millipore) 
according to manufacturer's directions. The nitrocellulose was temporarily stained with 10% 
Ponceau S fS.B. Carroll and A. Laughon. "Production and Purification of Polyclonal 
Antibodies to the Foreign Segment of (3-galactosidase Fusion Proteins." in DM Cloning; A 
Practical Approach. Vol.111. (D. Glover, ed.). pp. 89-1 II. IRL Press. Oxford. (1987)] to 
visualize the lanes, then destaincd by running a gentle stream of distilled water over the blot 
for several minutes. The nitrocellulose was immersed in PBS containing 3% BSA ovcrniiiht 
at 4°C to block any remaining protein binding sites. 

The blot was cut into strips and each strip was incubated with the appropriate primarv 
\> antibodv The avian anti-f. hotuluium antibodies [described in (c)| and pre-immune chicken 
antibody (as control) were diluted 1:125 in PBS containing I mg/ml USA for 2 hours at room 
temperature. The blots were washed with two changes each of large volumes of PBS. BBS- 
'I* ween and PBS. successively (10 min/wash). Goat anti-chicken IgG alkaline phosphatase 
conjugated secondary antibody (I'isher Biotech) was diluted 1:500 in PBS containing 1 mg/ml 
20 BSA and incubated with the blot for 2 hours at room temperature. The blots were washed 

with two changes each of large volumes of PBS and BBS-Tween. followed by one change of 
PBS and 0.1 M Tris-IICI. pH 9.5. Blots were developed in freshly prepared alkaline 
phosphatase substrate buffer (100 Mg/ml nitroblue tetrazolium (Sigma). 50 yvpm\ 5-bromo-4- 
chloro-3-indolyl phosphate (Sigma). 5 mM MgCU in 50 mM NaXO v pH 9.5). 
-* l he Western blots are shown in Figure I . The anti-f ' hotu/inum IgY reacted to the 

toxoid to give a broad immunoreactive band at about 145-150 kD on the reducing gel. This 
toxoid is refractive to disulfide cleavage by reducing agents due to formalin crosslinking. The 
immune IgY reacted with the active toxin complex, a 97 kD C. hotu/inum type A heavy chain 
and a 53 kD light chain. The preimmune IgY was unreactive to the ( ' boudinum complex or 
30 toxoid in the Western blot. 
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c) Antitoxin Antibody Titer 

The IgY antibody titer to ('. boiulinum type A toxoid of eggs harvested between day 
409 and 423 was evaluated by F.LISA, prepared as follows. Ninety-six-well Falcon Pro-bind 
plates were coated overnight at 4°C with 100 nl/well toxoid [B.R. Singh & B.R. Das Gupta. 
Toxicon 27:403 (I989)| at 2.5 ug/ml in PBS. pH 7.5 containing 0.005% thimerosal. The 
following day the wells were blocked with PBS containing 1% BSA for I hour at 37°C. The 
IgY from immune or preimmunc eggs was diluted in PBS containing 1% BSA and 0.05% 
Tueen 20 and the plates were incubated for I hour at 37°C. The plates were washed three 
limes with PBS containing 0.05% Tvveen 20 and three times with PBS alone. Alkaline 
phosphatasc-conjugated goat-ami-chicken IgG (Fisher Biotech) was diluted 1:750 in PBS 
containing 1% BSA and 0.05% Tween 20. added to the plates, and incubated 1 hour at 37°C. 
The plates were washed as before, and p-nitrophenyl phosphate (Sigma) at I mg/ml in 0.05 M 
Na ,CO,. pi I 9.5. |() m.V1 MgCU was added. 

The results are shown in Figure 2. Chickens immunized with the toxoid generated 
high titers of antibody to the imnumogen. Importantly, eggs from both immunized hens had 
signillcant ami-immunogen antibody liters as compared to preimmune control eggs. The anli- 
C homliimm IgY possessed significant activity, to a dilution of 1:9.1.750 or greater. 

EXAMPLE A 

Preparation Of Avian Lgg Yolk Immunoglobulin In An Orally Administrate Form 

In order to administer avian IgY antibodies orally to experimental mice, an effective 
delivery formula for the IgY had to be determined. The concern was that if the crude IgY 
was dissolved in PBS. the saline in PBS would dehydrate the mice, which might prove 
harmful over the duration of the study. Therefore, alternative methods of oral administration 
of IgY were tested. The example involved: (a) isola-tion of immune IgY: (h) solubilization 
of IgY in water or PBS. including subsequent dialysis of the IgY-PBS solution with water to 
eliminate or reduce the salts (salt and phosphate) in the buffer: and (c) comparison of the 
quantity and activity of recovered IgY by absorbance at 280 nm and PACiii. and enzyme- 
linked immunoassay (F.LISA). 
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a) Isolation Of Immune IgY 

In order to investigate the most effective delivery formula for IgY. we used IgY which 
was raised against Cmialus durissus lerrificus venom. Three eggs were collected from hens 
imnninized with the C. Jurist wrijicus venom and IgY was extracted from the yolks using 
the modified Poison procedure described by Thallcy and Carroll | Bio/Technology. 8:934-938 
(IW())| as described in Example 1(c). 

The egg yolks were separated from the whiles, pooled, and blended with four volumes 
of PBS. Powdered PEG 8000 was added to a concentration of 3.5%. The mixture was 
ecntrifuged at 10.000 rpm for 10 minutes to pellet the precipitated protein, and the 
supernatant was tillered through cheesecloth lo remove the lipid layer. Powdered PI-G 8000 
was added to the supernatant lo bring the final PFX, concentration to 12% (assuming a PEG 
concentration of 3.5% in the supernatant). The 12% PEG/lgY mixture was divided into two 
euual volumes and centrifugeel to pellet the IgY. 

I») Solubilization Of The I«Y In Water Or PBS 

One pellet was resuspended in 1/2 the original yolk volume of PBS. and the other 
pellet was resuspended in 1/2 the original yolk volume of water. The pellets were then 
ecmrifuged to remove any particles or insoluble material. The IgY in PBS solution dissolved 
readilv but the fraction resuspended in water remained cloudy. 

In order to satisfy anticipated sterility requirements lor orally administered anybodies. 
Hie antibody solution needs to be filter-sterilized (as an alternative to heat sterilization which 
would destroy the antibodies). The preparation of IgY resuspended in water was too cloudv 
lo pass through cither a 0.2 or 0.45 um membrane filter, so 10 m ] of the PBS resuspended 
fraction was dialyzed overnight at room temperature against 250 ml of water. The following 
morning the dialysis chamber was emptied and refilled with 250 ml of fresh U.O for a second 
dialysis. Thereafter, the yields of soluble antibody were determined at OD... and are 
compared in Table 7. 
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TABLE 7 



Fraction 


Absorbance Of 1:10 Dilution Al 280 nm 


Percent Recovery 


PBS dissolved 


1.149 


100% 


IU> dissolved 


0.706 


61% 


PBS dissolved H.O dialvzed 


0.H8.S 


77% 



Resuspending the pellets in PBS followed by dialysis against water recovered more 
antibody than directly resuspending the pellets in water (77% versus 61%). Equivalent 
volumes of the IgY preparation in PBS or water were compared by PAGE, and these results 
were in accordance with the absorbance values (data not shown). 



O Activity Of I«\ Prepared With Different Solvents 

An 1:1. ISA was performed to compare the binding activity of the luY extracted bv 
each procedure described above. ( ,/,„™ ,em,icus « •,/.,.) venom at 2.5 uu/ml j„ Pus 
-as used to coat each wel. of a %-well microliter plate. The remaining protein binding sues 
wore blocked with PBS containing 5 mg/mi BSA. I> r imary antibodv dilutions (in PBS 
containing I n,g,ml BSA) were added in duplicate. After 2 hours of incubation at room 
temperature, the unbound primary antibodies were removed by washing the wells with PBS 
HBS-Twccn. and PBS. The species specific secondary antibody (goat anti-chicken 
immunoglobulin alkalinc-phosphatase conjugate (Sigma, was diluted 1:750 in PBS containing 
I my/ml BSA and added to each well of the microtia plate. After 2 hours of incubation « 
-om temperature, .he unbound secondary antibody was removed by washing the plate as 
belore. and freshly prepared alkaline phosphatase substrate (Sigma) a, I mg/ml in SO mM 
Na,CO, |() mM M uCI, pi I 9.5 was added to each well. The color development was 
measured on a Dynatech MR 700 microplate reader using a 412 nm (liter. The results are 
shown in [able 8. 

The binding assay results parallel the recovery values in Table 7. with PBS-dissolved 
luY showing slightly „ K , rc actIvilv t|ian lhe PBS-dissolved/l I.O dialed antibodv The 
water-dissolved" antibody had considerably less binding activity than the other preparations 
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EXAMPLE 5 

Survival Of Antibody Activity After Passage Through The Gastrointestinal Tract 

In order to determine the feasibility of oral administration of antibody, it was of 
interest to determine whether orally administered IgY survived passage through the 
gastrointestinal tract. The example involved; (a) oral administration of specific immune 
antibody mixed with a nutritional formula: and (b) assay of antibody activity extracted fr 
feces. 



TABLE 8 

Amigen-Uindint; Activity 01' IgY Prepared With Different Solvents 



Dilution 


Preimmune 


PBS Dissolved 


1 1 O Dissolved 


PBS/I l : 0 


1:500 


0005 


1.748 


1.577 


1.742 


1:2.500 


0.004 


0.644 


0.34*) 


0.606 


1:12.500 


0.001 


0.144 


0.054 


II. (W0 


1:62.500 


0.001 


0.025 


0.007 


0.016 


1:312.500 


0.0 to 


0.000 


0.000 


0.002 



a) Oral Administration Of Antibody 

The IgY preparations used in this example are the same PBS-dissoived/l-KO dialyzed 
antivenom materials obtained in Hxample 4 above, mixed with an equal volume of I-nlamil*. 
I wo mice were used in this experiment, each receiving a different diet as follows: 

1 ) water and food as usual: 

2) immune IgY preparation diatyzed against water and mixed 1:1 with knlamil*. 
( I he mice were given the corresponding mixture as their only source of food and water). 

b) Antibody Activity After Ingestion 

After both mice had ingested their respective fluids, each tube was refilled with 
approximately 10 ml of the appropriate fluid first thing in the morning. By mid-morning 
there was about 4 to 5 mi of liquid left in each tube. At this point stool samples were 
collected from each mouse, weighed, and dissolved in approximately 500 ^tl PBS per 100 mg 
stool sample. One hundred and sixty mg of control stools (no antibody) and W mg of 
experimental stools (specific antibody) in 1.5 mi microfuge tubes were dissolved in 800 and 
500 nl PBS. respectively. The samples were heated at 37°C lor 10 minutes and vortexed 
vigorously. The experimental stools were also broken up with a narrow spatula. IZach sample 
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was centrifuged for 5 minutes in a microfugc and the supernatonts. presumably containing the 
antibody extracts, were collected. The pellets were saved at 2-8°C in case future extracts 
were needed. Because the supematants were tinted, they were diluted five-fold in l>BS 
containing 1 mg/ml BSA for the initial dilution in the enzyme immunoassay (F.LISA). The 
primary extracts were then diluted live-fold serially from this initial dilution. The volume of 
primary extract added to each well was 190 ul. The ELISA was performed exactly as 
described in Example 4. 

TABLE 9 



Dilution 


IVeimmune IgY 


Control Fecal Extract 


uiuii i run 

1:XP. Fecal Extract 


1:5 


0 


0.000 


0.032 


1:25 


0.016 


■ 0 


0.016 


1:125 


0 


• 0 


o.ooy 


1:625 


0 


0.003 


0.001 


l:.i 125 


0 


0 


0.000 



There uas some active antibody in the fecal extract from the mouse given the specific 
antibody in CnfamiLu, formula, but it was present at a very low level. Since the samples were 
assayed at an initial 1 :5 dilution, the binding observed could have been higher with less dilute 
samples. Consequently, the mice were allowed to continue ingesting either regular food and 
water or the specific IgY in Enfamil K, formula, as appropriate, so the assay could be repeated. 
Another IT. ISA plate was coated overnight with 5 ug/ml of Cel. I. venom in PBS. 

The following morning the 1-LISA plate was blocked with 5 mg/ml BSA. and the fecal 
samples were extracted as before, except that instead of healing the extracts at 37»C. the 
samples were kept on ice to limit proteolysis. The samples were assayed undiluted initially, 
and in 5X serial dilutions thereafter. Otherwise the assay was carried out as before. 
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TABLE 10 



Dilution 


Preimmune IgY 


Control Fx tract 


lixp. Extract 


undiluted 


0.003 


0 


0.379 


1:5 


• 0 


• 0 


0.071 


1:25 


0.000 


0 


0.027 


1:125 


0.003 


0 


0.017 


1:625 


0.000 


* 0 


0.008 


1:3125 


0.002 


0 


0.002 



The experiment confirmed the previous results, with the antibody activity markedly 
higher. The control fecal extract showed no anti-( '.</./. activity, even undiluted, while the 
fecal extract from the anU-C di. IgY/nnlamihsMcd mouse showed considerable anti-( \di. 
activity. This experiment (and the previous experiment) clearly demonstrate that active IgY 
antibody survives passage through the mouse digestive tract, a finding with favorable 
implications tor the success of IgY antibodies administered orally as a therapeutic or 
prophylactic. 



EXAMPLE 6 

hi Vivo Neutralization 01' Type C. hoiiilinum 
Type A Neurotoxin By Avian Antitoxin Antibody 

This example demonstrated the ability of PtiG-purilicd antitoxin, collected as 
described in Example 3. to neutralize the lethal effect of C. hntulimnn neurotoxin type A in 
mice. To determine the oral lethal dose (I.D 1(K) ) of toxin A. groups of BALB/c mice were 
given different doses of toxin per unit body weight (average body weight of 24 grams), for 
oral administration, toxin A complex, which contains the neurotoxin associated with other 
non-toxin proteins was used. This complex is markedly more toxic than purified neurotoxin 
when given by the oral route. |1. Ohishi a <//.. Infect. Immun.. 1 6: 1 Oft (1977).] ('. bomlintim 
toxin type A complex, obtained from Uric Johnson (University Of Wisconsin. Madison > was 
250 ug/ml in 50 mM sodium citrate, pi I 5.5. specific toxicity 3 x |() 7 mouse I.D 5( /mg with 
parenteral administration. Approximately 40-50 ng/gm body weight was usually fatal within 
48 hours in mice maintained on conventional food and water. When mice were tiiven a diet 
ad libitum of only Enfamil* the concentration needed to produce lethality was approximately 
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2.5 rimes higher (125 ng/gm body weight). Botulinal toxin concentrations of approximately 
200 ng/gm body weight were fatal in mice fed Enfamil® containing preimmune igY 
(rcsuspcnded in Enfamil® at the original yolk volume). 

The oral LD I(M , off. hoiuiinum toxin was also determined in mice that received 
known amounts of a mixture of preimmune IgY-Ensure® delivered orally through feeding 
needles. Using a 22 gauge feeding needle, mice were given 250 u | each of a preimmune" 
IgY-Ensure«: mixture (preimmune IgY dissolved in 1/4 original yolk volume) I hour before 
and 1/2 hour and 5 hours after administering botulinal toxin. Toxin concentrations given 
orally ranged from approximately 12 to 312 ng/gm body weight (0.3 to 7.5 ug per mouse). 
Botulinal toxin complex concentration of approximately 40 ng/gm body weight (1 M g per 
mouse) was lethal in all mice in less than 36 hours. 

Tun groups of BALB/c mice. 10 per group, were each given orally a single dose of 1 
ug each of botulinal toxin complex in 100 M | of 50 mM sodium citrate P H 5.5. The mice 
received 250 ui treatments of a mixture of either preimmune or immune IgY in Unsure i'.; ( 1/4 
original >olk volume) I hour before and 1/2 hour. 4 hours, and 8 hours alicr botulinal toxin 
administration. The mice received three treatments per day for two more days. The mice 
were observed lor % hours. The survival and mortality are shown in Table II. 

TABLE II 



Toxin Dose nii-'iim 


A n ti body Type 


Number Of Mice Alive 


Number Of Mice Dead 


•4 1 .(> 


non-immune 


0 


10 


-41.6 


ami-hotultnni toxin 


1(1 


0 



All mice treated with the preimmune IgY-Ensure* mixture died within 46 hours post- 
u.xin administration. The average time of death in the mice was 32 hours post toxm 
administration. Treatments of preimmune IgY-Ensurc® mixture did not continue beyond 24 
hours due to extensive paralysis of the mouth in mice of this group. In contrast, all ten mice 
treated with the immune anti-botulinal toxin IgY-Ensureyc mixture survived past 96 hours. 
Only 4 mice in this group exhibited symptoms of botulism toxicity (two mice about 2 days 
after and two mice 4 days after toxin administration). These .nice eventually died 5 and 6 
days later. Six of the mice in this immune group displayed no adverse effects to the toxin 
and remained alive and healthy long term. Thus, the avian anti-botulinal toxin antibodv 
demonstrated very good protection from the lethal effects of the toxin in the experimental 
mice. 
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EXAMPLE 7 

Production Of An Avian Antitoxin Against Clostridium difficile Toxin A 

Toxin A is a potent cytotoxin secreted by pathogenic strains of C. difficile, that plays a 
direct role in damaging gastrointestinal tissues. In more severe cases of ('. difficile 
intoxication, pseudomembranous colitis can develop which may be fatal. This would be 
prevented by neutralizing the effects of this toxin in the gastrointestinal tract. As a first step, 
antibodies were produced against a portion of the toxin. The example involved: (a) 
conjugation of a synthetic peptide of toxin A to bovine serum albumin: (b) immunization of 
hens with the peptide-BSA conjugate: and (c) detection of antitoxin peptide antibodies by 
lil.lSA. 



a) C onjugation Of A Synthetic Peptide Of Toxin A To Bovine 
Scrum Albumin 

15 Thc .synthetic peptide CQTIDGKKYYFN-NI U (SKQ II) N():82) was prepared 

commercially (Multiple Peptide Systems. San Diego. CA) and validated to be -80% pure by 
high-pressure liquid chromatography. The eleven amino acids following the cysteine residue 
represent a consensus sequence of a repeated amino acid sequence found in "loxin A. [Wren 
vf ui. Infect. Immun.. 59:3151-3155 (I991),| The cysteine was added to facilitate 

20 conjugation to carrier protein. 

In order to prepare the carrier for conjugation. USA (Sigma) was dissolved in 0.01 M 
NaPOj. pi I 7.0 to a final concentration of 20 mg/ml and ivnialeimidoben/.oyl-N- 
hydroxysiiceinimide ester (MBS: Pierce) was dissolved in N.N-dimethyl formamide to a 
concentration of 5 mg/ml. MBS solution. 0.51 ml. was added to 3.25 ml of the BSA solution 

2> and incubated for 30 minutes at room temperature with stirring every 5 minutes. The MBS- 
activated BSA was then purified by chromatography on a Bio-Ciel P-10 column (Bio-Rad: 40 
ml bed volume) equilibrated with 50 mM NaP0 4 . pi I 7.0 buffer. Peak fractions were pooled 
(6.0 ml). 

Lyophili/ed toxin A peptide (20 mg) was added to the activated BSA mixture, stirred 
30 until the peptide dissolved and incubated 3 hours at room temperature. Within 20 minutes, 
the reaction mixture became cloudy and precipitates formed. After 3 hours, the reaction 
mixture was centrifuged at 10.000 x g for 10 min and the supernatant analyzed for protein 
content. No significant protein could be detected at 280 nm. The conjugate precipitate was 
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washed three times with PBS and stored a. 4°C. A second conjugation was performed with 
15 mg of activated BSA and 5 mg of peptide and the conjugates pooled and suspended at a 
peptide concentration of 10 mg/ml in 10 mM Nal»0 4 . pH 7.2. 

») Immunization Of Hens With Peptide Conjugate 

Two hens were each initially immunized on day zero by injection into iwo 
subcutaneous and two intramuscular sites with 1 mg of peptide conjugate that was emulsified 
m CFA (GIBCO). The hens were boosted on day 14 and day 21 with I mg of peptide 
conjugate emulsified in IFA (GIBCO). 

c) Detection Of Antitoxin Peptide Antibodies By ELISA 

IgY was purified from two eggs obtained before immunization (pre-immune) and two 
oggs obtained 31 and 32 days after .he initial immunization using PEG fractionation as 
described in lixamplc I. 

Wells of a %-vvcll microtiter plate (falcon Pro-Bind Assay Plate) were coated 
overnight a. 4°C with 100 ug/ml solution of the toxin A synthetic peptide in PBS. pH 7.2 
prepared by dissolving 1 mg of the peptide in 1.0 ml of H.O and dilution of PBS. The pre- 
immune and immune IgY preparations were diluted in a live-fold series in a buffer containing 
1% Pitt 8000 and 0..% Tween-2<) (v/v) in PBS. pH 7.2. The wells were blocked for 2 
hours at room temperature with .50 M | of a solution containing 5% (v/v) Carnation*, nonfat 

'"ilk and 1% PEG 8000 in PBS. P H 7.2. After incubation for 2 hours at room 
temperature, the wells were washed, secondary rabbi, anti-chicken IgG-alkaline phosphatase 
I l:7M») added. ,he wells washed again and the color development obtained as described in 
hxumple I. The results are shown in Table 12. 



Keactiviu 

Dilution Of PHG Prep 


TABLE 12 

' Of IgY Willi Toxin Peptide 

Absorbance At 410 nm 


Preimmune 


Immune Anti-Peptide 


1:100 


0.013 


0.253 


1:500 


0.004 


(1.030 


1:2500 


0.004 


0.005 



30 



Clearly, the immune antibodies contain titers against this repeated 



epitope of toxin A. 
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EXAMPLE 8 

Production Of Avian Antitoxins Against 
Clostridium difficile Native Toxins A And B 

To determine whether avian antibodies are effective tor the neutralization of C. 
difficile toxins, hens were immunized using native difficile toxins A and B. The resultinu 
cut: yolk antibodies were then extracted and assessed for their ability to neutralize toxins A 
and B in vitro. The Example involved (a) preparation of the toxin immunogens, fb) 
immunization, (e) purification of the antitoxins, and (d) assay of toxin neutralization activity. 

a) Preparation Of The Toxin Immunogens 

Botii C. difficile native toxins A and B. and ('. difficile toxoids, prepared by the 
treatment of the native toxins with formaldehyde, were employed as immunogens. ( '. difficile 
toxoids A and B were prepared by a procedure which was modified from published methods 
(Lhrieh el at.. Infect, immun. 28:1041 (1980). Separate solutions (in PBS) of native (' 
difficile toxin A and toxin B (Tech Lab) were each adjusted to a concentration of 0.20 mg/ml. 
and formaldehyde was added to a final concentration of 0.4%. The toxin/formaldehyde 
solutions were then incubated at 37°C for 40 hrs. i ; rec formaldehyde was then removed from 
the resulting toxoid solutions by dialysis against PBS at 4°C\ In previously published reports, 
this dialysis step was not performed. Therefore, free formaldehyde must have been present in 
their toxoid preparations. The toxoid solutions were concentrated, using a Centriprep 
concentrator unit (Amicon). to a final toxoid concentration of 4.0 mg/ml. The two resulting 
preparations were designated as toxoid A and toxoid B. 

C difficile native toxins were prepared by concentrating stock solutions of toxin A and 
toxin B (Tech Lab. lnc). using Centriprep concentrator units (Amicon). lo a final 
concentration of 4.0 mg/ml. 

b) Immunization 

fhc first two immunizations were performed using the toxoid A and toxoid B 
immunogens described above. A total of 3 different immunization combinations were 
employed, for the first immunization group. 0.2 ml of toxoid A was emulsified in an equal 
volume of Tiler Max adjuvant (CytRx). T iter Max was used in order to conserve the amount 
of immunogen used, and to simplify the immunization procedure. This immunization group 
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was designated "CTA." For the second immunization group. 0.1 ml of toxoid B was 
emulsified in an equal volume of Titer Max adjuvant. This group was designated "CTB." 
For the third immunization group. 0.2 ml of toxoid A was first mixed with 0.2 ml of toxoid 
B. and the resulting mixture was emulsified in 0.4 ml of Titer Max adjuvant. This group was 
designated "CTAB." In this way. three separate immunogen emulsions were prepared, with 
each emulsion containing a final concentration of 2.0 mg/ml of toxoid A (CTA) or toxoid B 
(CTB) or si mixture of 2.0 mg/ml toxoid A and 2.0 mg/ml toxoid B (CTAB). 

On day 0. White Leghorn hens, obtained from a local breeder, were immunized as 
follows: Group CTA. Tour hens were immunized, with each hen receiving 200ug of toxoid 
A. via two intramuscular (I.M.) injections of 50^1 of CTA emulsion in the breast area. 
Group CTB. One hen was immunized with 200ug of toxoid B. via two I.M. injections of 
50ul of CTB emulsion in the breast area. Group CTAB. four hens were immunized, with 
each hen receiving a m j xturc containing 200ug of toxoid A and 200ug of toxoid B. via two 
I.M. injections of 1()() M | of CTAB emulsion in the breast area. The second immunization was 
performed 5 weeks later, on day 35. exactly as described for the first immunization above. 

In order to determine whether hens previously immunized with C. difficile toxoids 
could tolerate subsequent booster immunizations using native toxins, a single hen from group 
CTAB was immunized lor a third time, this lime using a mixture of the native toxin A and 
native toxin B described in section <a) above (these toxins were not formaldehyde-treated, and 
were used in .heir active form). This was done in order to increase the amount (titer) and 
affinity of specific antitoxin antibody produced by the hen over that achieved by immunizing 
with toxoids only. On day o2. 0. 1 ml of a toxin mixture was prepared which contained 
200ug ol- native toxin A and 200ug of native toxin B. This toxin mixture was then 
emulsified in 0.1 ml ofTitcr Max adjuvant. A single CTAB hen was then immunized with 
the resulting immunogen emulsion, v ia two I.M. injections of 100 M 1 each, into the breast area. 
This hen was marked with a wing band, and observed for adverse effects for a period of 
approximately I week, after winch lime the hen appeared to be in good health. 

Because the CTAB hen described above tolerated the booster immunization with native 
toxins A and B with no adverse effects, it was decided to boost the remaining hens with 
native toxin as well. On day 70. booster immunizations were performed as follows: Group 
CTA. A 0.2 ml volume of the 4 mg/ml native toxin A solution was emulsified in an equal 
volume of Titer Max adjuvant. Each of the 4 hens was then immunized with 2()0ug of native 
toxin A. as described for the toxoid A immunizations above. Group CTB. A 50ul volume 
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of the 4 nig/ml native toxin B solution was emulsified in an equal volume of Titer Max 
adjuvant. The hen was then immunized with 200ug of native toxin B. as described for the 
toxoid B immunizations above. Group CTAB. A 0.15 ml volume of the 4 mg/ml native 
toxin A solution was first mixed with a 0.15 ml volume the 4 mg/ml native toxin B solution. 
The resulting toxin mixture was then emulsified in 0.3 ml of Titer Max adjuvant. The 3 
remaining hens (the hen with the wing band was not immunized this time) were then 
immunized with 200ug of native toxin A and 200ug of native toxin B as described for the 
toxoid A-i toxoid B immunizations (CTAB) above. On day 85. all hens received a second 
booster immunization using native toxins, done exactly as described for the llrst boost with 
native toxins above. 

All hens tolerated both booster immunizations with native toxins with no adverse 
effects. As previous literature references describe the use of formaldehyde-treated toxoids, 
this is apparently the first time thai any immunizations have been performed using native ('. 
ilifjkilc toxins. 

c) Purification Of Antitoxins 

l-ggs were collected from the hen in group CTB 10-12 days following the second 
immunization with toxoid (day 35 immunization described in section (b) above), and from the 
hens in groups CI A and CTAB 20-21 days following the second immunization with toxoid, 
lo be used as a prc-immune (negative) control, eggs were also collected from unimmunized 
hens from the same flock. Hgg yolk immunoglobulin (IgY) was extracted from the 4 groups 
of eggs as described in Example I (c). and the final IgY pellets were solubilized in the 
original yolk volume of PBS without thiinerosal. Importantly, thimcrosal was excluded 
because it would have been toxic lo the CHO cells used in the toxin neutralization assays 
described in section (d) below. 

d) Assay Of Toxin Neutralization Activity 

The toxin neutralization activity of the IgY solutions prepared in section (c) abov 
determined using an assay system that was modified from published methods. | F.hrich ci ul. 
Infect. Immun. 28:1041-1047) (1W2): and McCiec ci ul. Micron. Path. 12:333-341 (|<W2).| 
As additional controls, affinity-purified goat anti-C. difficile toxin A (Tech I. ah) and affinity 
purified goat anti-C. itifficile toxin B (Tech Lab) were also assayed for toxin neutralization 
activity. 



e was 
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Hie IgY solutions and goat antibodies were serially diluted using F 12 medium 
(G1BCO) which was supplemented with 2% FCS (GIBCO)(this solution will be referred to as 
• medium" lor the remainder of this Example). The resulting antibody solutions were then 
mixed with a standardized eoncentration of either native ( '. difficile toxin A (Tech Lab), or 
native C. difficile toxin B (Tech Lab), at the concentrations indicated below. Following 
incubation at .!7°C lor 60 min.. lOOul volumes of the toxin ► antibody mixtures were added 
to the wells of %-wcll microliter plates (Falcon Microtest III) which contained 2.5 x I0 J 
Chinese Hamster Ovary (CHO) cells per well (the CI 10 cells were plated on the previous day 
io allow them to adhere to the plate wells). The final concentration of toxin, or dilution of 
antibody indicated below refers to the final test concentration of each reagent present in the 
respective microliter plate wells. Toxin reference wells were prepared which contained CHO 
cells and toxin A or toxin B at the same concentration used for the toxin plus antibody 
mixtures (these wells contained no antibody). Separate control wells were also prepared 
which contained CHO ceils and medium only. The assay plates were then incubated lor 18- 
24 his. in a 17°C. humidified. 5% CO, incubator. On the following day. the remaining 
adherent (viable) cells in the plate wells were stained using 0.2% crystal violet (Mallinckrodt) 
dissolved in 2% ethanol. for 10 min. F.xcess stain was then removed by rinsing with water, 
and the stained cells were solubili/.ed by adding lOOul of 1% SDS (dissolved in water) to 
each well. The absorbance of each well was then measured at 570 nm. and the percent 
cytotoxicity of each test sample or mixture was calculated using the following formula: 



% CHO Cell Cytotoxicity = |1 - < * bs - Sam P le )] X 100 

Abs. Control 



Unlike previous reports which quanlitate results visually by counting cell rounding by 
microscopy, this Example utilized spectrophotometry methods io quantise the ( '. difficile 
toxin bioassay. In order to determine the toxin A neutralizing activity of the CTA. (.TAB. 
and pre-immune IgY preparations, as well as the affinity-purified goal antitoxin A control, 
dilutions of these antibodies were reacted against a 0.1 Mg /ml concentration of native toxin A 
(this is the approx. 50% cytotoxic dose of toxin A in this assay system). The results are 
shown in Figure .1. 

Complete neutralization of toxin A occurred with the CTA IgY (antitoxin A. above) a. 
dilutions of 1:80 and lower, while significant neutralization occurred out to the 1:320 dilution. 
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The CTAB IgY (antitoxin A + toxin B. above) demonstrated complete neutralization at the 
1:320-1:160 and lower dilutions, and significant neutralization occurred out to the 1:1280 
dilution. The commercially available affinity-purified goal antitoxin A did not completely 
neutralize toxin A at any of the dilutions tested, but demonstrated significant neutralization 
out to a dilution of 1:1.280. The preimmune IgY did not show any toxin A neutralizing 
activity at any of the concentrations tested. These results demonstrate that IgY purified from 
eggs laid by hens immunized with toxin A alone, or simultaneously with toxin A and toxin B. 
is an effective toxin A antitoxin. 

The toxin B neutralizing activity of the CTAB andpre-immunc IgY preparations, and 
also the affinity-purified goat antitoxin B control was determined by reacting dilutions of 
these antibodies against a concentration of native toxin B of 0.1 ng/ml (approximately the 
50% cytotoxic dose of toxin B in the assay system). The results are shown in figure 4. 

Complete neutralization of toxin B occurred with the CTAB IgY (antitoxin A - toxin 
B. above) at the. 1:40 and lower dilutions, and significant neutralization occurred out to the 
1:320 dilution. The affinity-purified goal antitoxin B demonstrated complete neutralization at 
dilutions of 1:640 and lower, and significant neutralization occurred out to a dilution of 
1:2.560. The preimmune IgY did not show any toxin B neutralizing activity at any of the 
concentrations tested. These results demonstrate that IgY purified from eggs laid by hens 
immunized simultaneously with toxin A and toxin B is an effective toxin B antitoxin. 

In. a separate study, the toxin B neutralizing activity of CTB. CTAB. and pre-immune 
IgY preparations was determined by reacting dilutions of these antibodies against a native 
toxin B concentration of O.Iug/ml (approximately 100% cytotoxic dose of toxin B in this 
assay system). The results are shown in Figure 5. 

Significant neutralization of toxin B occurred with the CTB IgY (antitoxin B. above) 
at dilutions of 1:80 and lower, while the CTAB IgY (antitoxin A * toxin B. above) was found 
to have significant neutralizing activity at dilutions of 1:40 and lower. The preimmune IgY 
did not show any toxin B neutralizing activity at any of the concentrations tested. These 
results demonstrate that IgY purified from eggs laid by hens immunized with toxin B alone, 
or simultaneously with toxin A and toxin B. is an effective toxin B antitoxin. 
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EXAMPLE 9 

In vivo Protection Of Golden Syrian Hamsters From 
C. difficile Disease By Avian Antitoxins Against C. difficile Toxins A And B 

The most extensively used animal model to study ( ■. difficile disease is the hamster 
ILyerly ,/ a/.. Infect. Immun. 47:349-352 (1992).] Several other animal models for 
antibiotic-induced diarrhea exist, but none mimic the human form of the disease as closclv as 
■he hamster model. |R. Fckety. "Animal Models of Antihiouc-lnduced Colins." in O. Zak and 
M. Sande (eds.). Experimental Models in Antimicrobial Chemotherapy. Vol. 2. pp.61 -77 
( I 'WO. | l„ ,his m odel. the animals are first predisposed to the disease bv the oral 
administration of an antibiotic, such as clindamycin, which alters the population of normallv- 
oecurnng gastrointestinal Hora (Fekety. a. 61-72). Following the oral challenge of these 
ammals with viable C. difficile organisms, the hamsters develop cectis. and hemorrhauc. 
ulceration, and inllammation are evident in the intestinal mucosa. [I.verlv e, al Infect 
l"-un. 47:349-352 ( 1985). | The animals become .ethargie. develop severe diarrhea, and a 
h-gh percentage of them die from the disease. (Lyerly et al. Infect. Immun. 47:349-352 

This model is therefore ideally suited for the evaluation of therapeutic agents 
designed for the treatment or prophylaxis of ( '. difficile disease. 

The ability of the avian C. difficile antitoxins, described in lixample I above to 
protect hamsters from C. difficile disease was evaluated using the Golden Svrian hamster 
model of ( '. difficile infection. The lixample involved <a, preparation of the avian ( difficile 
anntuxins. ,b, ,„ ,,V„ protection of hamsters from < difficile disease by treatment with avian 
antitoxin,, and (c) long-term survival of treated hamsters. 

ii) Preparation Of The Avian C. difficile Antitoxins 

Fggs were collected from hens in groups CTA and CTAB described in Example 1 (b) 
above. To be used as a pre-immune (negative) control, eggs were also purchased from a local 
supermarket. Egg yolk immunoglobulin (IgY) was extracted from the 3 groups of c,,s as 
Ascribed in Example I ,c). and the final IgY pellets were solubilized in one fourth the 
original > oik volume of Ensure k nutritional formula. 
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b) /// vivo Protection Of Hamsters Against C difficile Disease By 
Treatment With Avian Antitoxins 

The avian ('. difficile antitoxins prepared in section (a) above were evaluated for their 
ability to protect hamsters from ('. difficile disease using an animal model system which was 
5 modified from published procedures. jFckcty. at 61-72: Borriclio el at.. J. Med. Microbiol.. 
24:53-64 (1087); Kim ei aL. Infect. Immun.. 55:2984-2992 (1987): Borriello ei aL J, Med. 
Microbiol.. 25:191-!% (1988): Delmee and Avesani. J. Med. Microbiol.. 33:85-90 (1990): 
and Lyerly ef aL. Infect. Immun.. 59:2215-2218 (I991).| l or the study, three separate 
experimental groups were used, with each group consisting of 7 female Golden Syrian 

10 hamsters (Charles River), approximately 10 weeks old and weighing approximately 100 gms. 
each. The three groups were designated "CTA." TTAB" and Tre-immune." These 
designations corresponded to the antitoxin preparations with which the animals in each group 
were treated. ITach animal was housed in an individual cage, and was offered food and water 
ad libitum through the entire length of the study. On day I. each animal was orally 

b administered 1.0 ml of one of the three antitoxin preparations (prepared in section (a) above) 
at the following limcpoints: 0 hrs.. 4 hrs.. and 8 hrs. On day 2. the day 1 treatment was 
repeated. On day 3. at the 0 hr. timepoint. each animal was again administered antitoxin, as 
described above. At I hr.. each animal was orally administered 3.0 mg of clindamvcm-NCI 
(Sigma) in 1 ml of water. This treatment predisposed the animals to infection with C. 

20 difficile. As a control lor possible endogenous C. difficile colonization, an additional animal 
I mm the same shipment (untreated) was also administered 3.0 mg of clindamycin-HCl in the 
same manner. This clindamycin control animal was left untreated (and uninfected) for the 
remainder of the study. At the 4 hr. and 8 hr. timepoinis, the animals were administered 
antitoxin as described above. On day 4. at the 0 hr. timepoint. each animal was again 

25 administered antitoxin as described above. At 1 hr.. each animal was orally challenged with I 
ml of ('. difficile inoculum, which contained approx. 100 ( difficile strain 43596 organisms 
in sterile saline. ( '. difficile strain 43596. which is a serogroup C strain, was chosen because 
it is representative of one of the most frequently-occurring serogroups isolated from patients 
with antibiotic-associated pseudomembranous colitis. I Delmee ei a/.. J. Clin. Microbiol.. 

30 28:2210-2214 ( 1985). | In addition, this strain has been previously demonstrated to be virulent 
in the hamster model of infection. | Delmee and Avesani. J. Med. Microbiol.. 33:85-90 
(1990). | At the 4 hr. and 8 hr. timepoinis. the animals were administered antitoxin as 
described above. On days 5 through 13. the animals were administered antitoxin 3x per day 
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as described for day I above, and observed for the onset of diarrhea and death. On the 
morning of day 14. the final results of the study were tabulated. These results are shown in 
Table 13. 

Representative animals from those that died in the Pre-lmmune and (TA groups were 
nccropsied. Viable C. difficile organisms were cultured from the ceca of these animals, and 
•he gross pathology of the gastrointestinal tracts of these animals was consistent with that 
expected for C. difficile disease (inflamed, distended, hemorrhagic cecum, filled with watcrv 
diarrhea-like material). In addition, the clindamycin control animal remained healthv 
throughout the entire study period, therefore indicating that the hamsters used in the'studv had 
not previously been colonized with endogenous C difficile organisms prior to the star, of the 
study. |:„|| mvine the final antitoxin treatment on day 13. a single surviving animal from the 
(TA group, and also from the (.TAB group, was sacrificed and nccropsied. No pathology 
was noted in either animal. 



TABLE 13 

Treatment Results 



Treat men t Group 



Pre-lmmune 



("TA (Antitoxin A onlv) 



No. Animals Surviving 



No. Animals Dead 



CTAI3 (Amiioxin A * Antitoxin 13) 



Treatment of hamsters with orally-administered toxin A and toxin 13 antitoxin (group 
CTAIi) successfully protected 7 out of 7 (100%) of the animals from ( : difficile disease. 
Treatment of hamsters with orally-administered toxin A antitoxin (group (TA) protected 5 
out of 7 (71%) of these animals from ( : difficile disease. Treatment usmy pre-immune IgY 
was not protective against C. difficile disease, as only I out of 7 (14%) of these an.mals " 
survived. These results demonstrate that the avian toxin A antitoxin and the avian toxin A - 
u.xin \\ antitoxin effectively protected the hamsters from C. difficile disease. These results 
also suggest that although the neutralization of toxin A alone confers some deuree of 
protection against C. difficile disease, in order to achieve maximal protection, simultaneous 
antitoxin A and antitoxin B activity is necessarv. 



<••) Long-Term Survival Of Treated Hamsters 

It has been previously reported in the literature that hamsters treated with orally- 
administered bov ine antitoxin IgG concentrate are protected from C. difficile disease as long 
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as the treatment is continued, but when the treatment is stopped, the animals develop diarrhea 
and subsequently die within 72 hrs. [Lyerly el at.. Infect. Immun.. 59(6):22 1 5-221 8 (1991).] 

In order to determine whether treatment of T. difficile disease using avian antitoxins 
promotes long-term survival following the discontinuation of treatment, the 4 surviving 
animals in group CTA. and the 6 surviving animals in group CTAB were observed for a 
period of 1 1 days (264 hrs.) following the discontinuation of antitoxin treatment described in 
section (b) above. All hamsters remained healthy through the entire posi-ireatment period. 
Ihis result demonstrates that not only does treatment with avian antitoxin protect auainst the 
onset of C. difficile disease (i.e.. it is effective as a prophylactic), it also promotes long-term 
survival beyond the treatment period, and thus provides a lasting cure. 

EXAMPLE 10 

In vivo Treatment Of Established difficile Infection In Golden 
Syrian Hamsters With Avian Antitoxins Against (\ difficile Toxins A And B 

The ability of the avian ( \ difficile antitoxins, described in Example 8 above, to treat 
an established ('. difficile infection was evaluated using the (iolden Syrian hamster model. 
The Example involved (a) preparation of the avian C. difficile antitoxins, (b) in vivo treatment 
of hamsters with established ( ' difficile infection, and (c) histologic evaluation of cecal tissue. 

a) Preparation Of The Avian C. difficile Antitoxins 

Eggs were collected from hens in group CTAB described in Example X (b) above, 
which were immunized with ( difficile toxoids and native toxins A and B. Eggs purchased 
from a local supermarket were used as a pre-immune (negative) control. Egg yolk 
immunoglobulin (IgY) was extracted from the 2 groups of eggs as described in Example 1 
(c), and the final IgY pellets were solubilizcd in one-fourth the original yolk volume of 
Ensure 1 " nutritional formula. 

b) In vivo Treatment Of Hamsters With Established C. difficile 
Infection 

I he avian (*. difficile antitoxins prepared in section (a) above were evaluated for the 
ability to treat established ('. difficile infection in hamsters using an animal model system 
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which was modified from the procedure which was described for the hamster protection study 
in Example 8(b) above. 

For the study, four separate experimental groups were used, with each group consisting 
of 7 female Golden Syrian hamsters (Charles River), approx. 10 weeks old. weighing 
approximately 100 gms. each. Each animal was housed separately, and was offered food and 
water ad libitum through the entire length of the study. 

On day I of the study, the animals in all four groups were each predisposed to C. 
difficile infection by the oral administration of 3.0 mg of clindamycin-HCI (Sigma) in 1 ml of 
water. 

On day 2. each animal in all four groups was orally challenged with 1 ml of 
C difficile inoculum, which contained approximately 100 C. difficile strain 43596 organisms 
in sterile saline. ( : difficile strain 43596 was chosen because it is representative of one of the 
most frequently-occurring serogroups isolated from patients with antibiotic-associated 
pseudomembranous colitis. [Delmee et <//.. .1. Clin. Microbiol.. 28:2210-2214 (|99()).] | n 
addition, as this was the same C. difficile strain used in all of the previous Examples above, it 
was again used in order to provide experimental continuity. 

On day 3 of the study (24 hrs. post-infection), treatment was started lor two of the 
four groups of animals. Each animal of one group was orally administered 1 .0 ml of the 
CTAB IgY preparation (prepared in section (a) above) at the following timepoints: 0 hrs.. 4 
hrs.. and 8 hrs. The animals in this group were designated "CTAB-24." The animals in the 
second group were each orally administered 1.0 ml of the pre-immune IgY preparation (also 
prepared in section (a) above) a. the same timepoints as for the (TAB group. These animals 
were designated "Pro-24." Nothing was done to the remaining two groups of animals on dav 

y. 

On day 4. 48 hrs. post-infection, the treatment described for day 3 above was repealed 
lor the CTAB-24 and Pre-24 groups, and was initiated for the remaining two groups at the 
same timepoints. The final two groups of animals were designated "CTAB-48" and Tre-48" 
respectively. 

On days 5 through 9. the animals in all four groups were administered antitoxin or 
pre-immune IgY. 3x per day. as described for day 4 above. The four experimental groups are 
summarized in Table 14. 
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TABLE 14 



Experimental Treatment Groups 



Group Dcsignaiiun 


Experimental Treatment 


CTAB-24 


Infected, treatment w/antitoxin |j«y started r«> 24 his. post- infect ion. 


Pre-24 


Infected, treatment w/pre-immune IgY started (a) 24 hrs. post- in feet ion. 


CTAB-48 


Infected, treatment w/antitoxin lyY started (w 48 hrs. post-infection. 


Pre-48 


Infected, treatment w/prc-immune IgY started (ti> 48 hrs. posi-infection. 


All animals were observed for the onset of diarrhea and death through the conclusion 
of the study on the morning of day 10. The results of this study are displayed in Table 15. 

TABLE 15 

Experimental Outcome-Dav 10 


1 rent men t Group 


No. Animals Surviving 


No. Animals Dead 


CTAH-24 


6 


1 


Pre-24 


0 


7 


CTAB-48 


4 


.i 


Pre-48 


2 





Eighty-six percent of the animals which began receiving treatment with antitoxin IgY 
at 24 hrs. post-infection (CTAB-24 above) survived, while 57% of the animals treated with 
antitoxin IgY starting 48 hrs. post-infection (CTAB-48 above) survived. In contrast, none of 
the animals receiving pre-immunc IgY starting 24 hrs. post-infection (Pre-24 above) survived, 
and only 20% of the animals which began receiving treatment with pre-immunc IgY at 48 
hrs, post-infection (Pre-48 above) survived through the conclusion of the study. These results 
demonstrate that avian antitoxins raised against C. difficile toxins A and 13 are capable of 
successfully treating established C. difficile infections in vivo. 



c) Histologic Evaluation Of Cecal Tissue 

In order to further evaluate the ability of the IgY preparations tested in this study to 
treat established ('. difficile infection, histologic evaluations were performed on cecal tissue 
specimens obtained from representative animals from the study described in section (b) above 

Immediately following death, cecal tissue specimens were removed from animals 
which died in the Pre-24 and Pre-48 groups, following the completion of the study, a 
representative surviving animal was sacrificed and cecal tissue specimens were removed from 
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the CTAB-24 and CTAB-48 groups. A single untreated animal from the same shipment as 
those used in the study was also sacrificed and a cecal tissue specimen was removed as a 
normal control, All tissue specimens were fixed overnight at 4°C in 10% buffered formalin 
The fixed tissues were paraffin-embedded, sectioned, and mounted on glass microscope slides. 
I he tissue sections were then stained using hematoxylin and eosin (II and K stain), and were 
examined by light microscopy. 

Mpon examination, the tissues obtained from the CTAB-24 and CTAB-48 animals 
showed no pathology, and were indistinguishable from the normal control. This observation 
provides further evidence for the ability of avian antitoxins raised against C difficile toxins A 
and B ,„ effectively treat established C. difficile infection, and to prevent the pathologic- 
consequences which normally occur as a result of C. tlifficUe disease. 

In contrast, characteristic substantial mucosal damage and destruction was observed in 
•I- tissues of the animals from the Pre-24 and Pre-48 groups which died Iron, ( ■. difficile 
disease. Normal tissue architecture was obliterated in these two preparations, as most of the 
mucosal layer was observed to have sloughed away, and there were numerous lar.e 
hemorrhagic areas containing massive numbers of erythrocytes. 

EXAMPLE 1 1 

Cloning And Expression Of C difficile Toxin A Fragments 

The toxin A gene has been cloned and sequenced, and shown ,o encode a protein of 
predated MW of .108 kd. fDove e, „/.. [nfect. Immun.. 58:480-488 (| W0 ).| (liven the 
expense and dif ficulty of isolating name toxin A protein, i, would be advantageous to use 
simple and mexpensive procaryotic expression systems to produce and purify hiuh levels of 
recombman, toxin A protein for immunization purposes. Ideally, the isolated recombinant 
nrotun would be soluble in order to preserve native antigenicity, since so.ubi.i 2 ed inclusion 
body proteins often do not fold into native conformations. To allow ease of purification the 
recombman. protein should be expressed to levels greater than 1 ntg/li,er of E co,i culture 
lo determine whether high levels of recombinant toxin A protein can be produced in 
col, fragments of the toxin A gene were cloned into various prokarvotic expression 
vectors, and assessed for the ability to express recombinant toxin A protein in £ ,„// Thrce 
prokaryonc expression systems were utilized. These systems were chosen because thev drive 
expre.ss.on of either fusion (pMALc and pGEX2T, or native (pLT23a-c, pmcm t0 |eve|s 
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in E. volL and allow affinity purification of the expressed protein on a ligand containing 
column. Fusion proteins expressed from pGEX vectors bind glutathione agarose beads, and 
are eluted with reduced glutathione. pMAL fusion proteins bind amylose resin, and are eluted 
with maltose. A poly-histidine tag is present at either the N-terminal (pETI6b) or C-tcrminal 
(pET23a-c) end of pET fusion proteins. This sequence specifically binds Nu chelate columns, 
and is eluted with imidazole salts. Intensive descriptions of these vectors are available 
[Williams et uL (1995) UNA Cloning 2: Expression Systems. Glover and I lames, eds. IRL 
Press. Oxford, pp. 15-58J. and will not be discussed in detail here. The Example involved (a) 
cloning of the toxin A gene, (b) expression of large fragments of toxin A in various 
prokaryotic expression systems, (c) identification of smaller toxin A gene fragments that 
express efficiently in colL (A) purification of recombinant toxin A protein by affinity 
chromatography, and (e) demonstration of functional activity of a recombinant fragment of the 
toxin A gene. 

a) Cloning Of The Toxin A Gene 

A restriction map of the toxin A gene is shown in Figure 6. The encoded protein 
contains a carboxy terminal ligand binding region, containing multiple repeats of a 
carbohydrate binding domain, (von liichel-Streiber and Sauerborn. Gene 96:107-1 13 (1990).] 
The toxin A gene was cloned in three pieces, by using either the polymerase chain reaction 
(PCR) to amplify specific regions, (regions I and 2. figure 6) or by screening a constructed 
genomic library for a specific toxin A gene fragment (region 3. Figure (>). The sequences of 
the utilized PCR primers arc PI: 5* GGAAATT TAGCTGCAGCATCTGAC 3* (SHQ ID 
N().:h: P2: 5* TCTAGCAAATTCGCTTGT GTTGAA .V <SHQ ID N().:2): P3: 5" 
CTCGCATATAGC'ATTAGACC 3' (SEQ ID NO.:3): and P4: 5* 
C'f ATCTAGGCCTAAAGTAT 3" (SEQ ID NO.:4). These regions were cloned into 
prokaryotic expression vectors that express either fusion (pMAI.c and pGEX2T) or native 
(pHT23a-c) protein to high levels in E. coli. and allow affinity purification of the expressed 
protein on a ligand containing column. 

Clostridium difficile VPI strain 10463 was obtained from the ATCC (ATCG #43255) 
and grown under anaerobic conditions in brain-heart infusion medium (BBL). High 
molecular-weight C difficile DNA was isolated essentially as described by Wren and 
Tabaqchaii (1987) J. Clin. Microbiol.. 25:2402. except proteinase K and sodium dodecyl 
sulfate (SDS) was used to disrupt the bacteria, and cctyltrimelhylammonium bromide 
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precipitation [as described in Ausubel et al.. Current Protocols in Molecular Biologj' (1989)] 
was used to remove carbohydrates from the cleared lysaic. The integrity and yield of genomic 
DNA was assessed by comparison with a serial dilution of uncut lambda DNA after 
electrophoresis on an agarose gel. 

fragments I and 2 were cloned by PCR. utilizing a proofreading thermostable DNA 
polymerase (native />/,/ polymerase: Stratagene). The high fidelity of this polymerase reduces 
ihc mutation problems associated with amplification by error prone polymerases (e. K .. Taq 
polymerase). PCR amplification was performed using the indicated PCR primers (Figure 6) 
in 50 nl reactions containing 10 mM Tris-HCI(8.3). 50 mM KCI. 1.5 mM MgCI, 200 uM 
each cINTP. 0.2 uM each primer, and 50 ng C. difficile genomic DNA. Reactions were 
overlaid with 100 M l mineral oil. heated to 94°C for 4 min. 0.5 M l native pju polymerase 
(Stratagene) added, and the reaction cycled 30x at 94°C for 1 min. 50°C for I min. 72°C for 
4 min. |„||„wcd by 10 min at 72°C. Duplicate reactions were pooled, chloroform extracted, 
and ethanol precipitated. After washing in 70% ethanol. the pellets were resuspended in 50 ul 
TI: buffer (10 mM Tris-HCL. 1 mM EDTA pH 8.0]. Aliquots of lOul each were restriction 
digested with either EcoKMHincW (fragment 1 ) or EcoRVP.sH (fragment 2). and the 
appropriate restriction fragments were gel purified using the Prep-A-Gene kit (BioRad). and 
ligatcd to either EcoRUSmal- restricted pGEX2T (Pharmacia) sector (fragment I), or the 
/,'u,R!//>.v/l pMAlc (New Fngland Biolabs) vector (fragment 2). Both clones are predicted to 
produce in-frame fusions with either the glutathione-S-transferase protein (pGF.X vector) or 
(he maltose binding protein (pMAL. vector). Recombinant clones were isolated, and 
confirmed by restriction digestion, using standard recombinant molecular biology techniques. 
ISambrook et al.. Molecular Cloning A Laboratory Manual ( 1989). and designated pGA30- 
660 and P MA660-1 100. respectively (see Figure 6 for description of the clone designations).) 

fragment 3 was cloned from a genomic library of size selected P.stl digested 
C. difficile genomic DNA. using standard molecular biology techniques (Sambrook et ul.). 
Given that the fragment 3 internal />,/! site is protected from cleavage in C. difficile genomic 
DNA [Price e, al.. Curr. Microbiol.. 16:55-60 (1987)]. a 4.7 kb fragment from Ps,\ restricted 
C. di/fici/e genomic DNA was gel purified, and ligated to /».*/! restricted, phosphatase treated 
pUC9 DNA. The resulting genomic library was screened with a oligonucleotide primer 
specific lo fragment 3. and multiple independent clones were isolated. The presence of 
fragment 3 in several of these clones was confirmed by restriction digestion, and a clone of 
the indicated orientation (Figure 6) was restricted with BamUUHindUL the released fracment 
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purified by gel electrophoresis, and iigaied into similarly restricted pET23c expression vector 
ON A (Novagen). Recombinant clones were isolated, and conllrmcd by restriction digestion. 
This construct is predicted to create both a predicted in frame fusion with the pET protein 
leader sequence, as well as a predicted C-terminal poly-hislidinc affinity tag. and is designated 
pPAl 100-2680 (see figure 6 for the clone designation). 



I>) Expression Of Large Fragments Of Toxin A In E. coli 
Protein expression from the three expression constructs made in (a) was induced, and 
analyzed by Western blot analysis with an affinity purified, goat polyclonal antiserum directed 
10 against the toxin A toxoid (Tech Lab). The procedures utilized for protein induction. SDS- 
PAGli. and Western blot analysis arc described in detail in Williams cf al (l ( )95). supra, in 
brief. 5 ml 2X YT (16 g tryptonc. 10 g yeast extract, 5 g NaCI per liter, pi I 7.5 + 100 >ig/ml 
ampicillin were added to cultures of bacteria (BL21 for pMAI and pGHX plasmids. and 
BL21(DL3)l.ysS for pET plasmids) containing the appropriate recombinant clone which were 
15 induced in express recombinant protein by addition of IPTG to 1 mM. Cultures were grown 
ai 37°C. and induced when the cell density reached 0.5 OD, nil . Induced protein was allowed 
to accumulate for two hrs after induction. Protein samples were prepared bv pelletinu 1 ml 
aliquots of bacteria by centrifugation (1 min in a microfuge), and resuspension of the pelleted 
bacteria in 150 ^1 of 2x SOS-PAGE sample buffer [Williams ct al. (IW5). supra]. The 
20 samples were heated to 95°C for 5 min. the cooled and 5 or 10 jil aliquots loaded on 7.5% 
SDS-PAGI: gels. BioRad high molecular weight protein markers were also loaded, to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
delected cither generally by staining gels with Coomassie blue, or specifically, by blotting to 
nitrocellulose for Western blot detection of specific immunoreactive protein. Western blots, 
b (performed as described in lixample 3) which detect toxin A reactive protein in cell lysates of 
induced protein from the three expression constructs are shown in Figure 7. In this figure, 
lanes 1-3 contain cell lysates prepared from /:. coli strains containing pPAl 100-2860 in 
Iil21(l)L3)lysK cells: lanes 4-6 contain celllysates prepared from L coli strains containing 
pPAl 100-2860 in BI2l(DL3)lysS cells: lanes 7-9 contain cell lysates prepared from I:, coli 
strains containing pMA30-660: lanes 10-12 contain cell lysates prepared from /•;. coli strains 
containing pMA660-l 100. The lanes were probed with an affinity purified goal antitoxin A 
polyclonal antibody (Tech Lab). Control lysates from uninduced cells (lanes 1. 7. and 10) 
contain very little immunoreactive material compared to the induced samples in the remaining 
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lanes. The highest molecular weight band observed for each clone is consistent with the 
predicted size of the full length fusion protein. 

Each construct directs expression of high molecular weight (HMW) protein that is 
reactive with the toxin A antibody. The size of the largest immunoreactive bands from each 
sample is consistent with predictions of the estimated MW of the intact fusion proteins. This 
demonstrates that the three fusions are in-frame, and that none of the clones contain cloning 
artifacts that disrupt the integrity of the encoded fusion protein. However, the Western blot 
demonstrates that fusion protein from the two larger constructs <pGA30-660 and pPAl 100- 
2680) are highly degraded. Also, expression levels of toxin A proteins from these two 
constructs are low. since induced protein bands are not visible by Coomassic staining (not 
shown). Several other expression constructs that fuse large sub-regions of the toxin A gene to 
cither pMALc or pCT23a-c expression vectors, were constructed and tested for protein 
induction. These constructs were made by mixing gel purified restriction fragments, derived 
from the expression constructs shown in Figure 6. with appropriately cleaved expression 
vectors, ligating. and selecting recombinant clones in which the toxin A restriction fragments 
had hgated together and into the expression vector as predicted for in-frame fusions. The 
expressed toxin A interval within these constructs are shown in Figure 8. as well as the 
internal restriction sites utilized to make these constructs. 

As used herein, the term "interval" refers to any portion U.c. any segment of the toxin 
which is less than the whole toxin molecule) of a clostridial toxin. In a preferred 
embodiment, "interval" refers lo portions off difficile toxins such as toxin A or toxin B. It 
is also contemplated that these intervals will correspond to epitopes of immunologic 
importance, such as antigens or immunogens against which a neutralizing antibody response is 
effected. I. is not intended that the present invention be limited to the particular intervals or 
-sequences described in these Examples, it is also contemplated that sub-portions of intervals 
ie.f!.. an epitope contained within one interval or which bridges multiple intervals) be used as 
compositions and in the methods of the present invention. 

In all cases. Western blot analysis of each of these constructs with goat antitoxin A 
antibody (Tech Lab) detected HMW fusion protein of the predicted size (no. shown,. This 
confirms that the reading frame of each of these clones is no. prematurely terminated, and is 
fused in the correct frame with the fusion partner. However, the Western blot analysis 
revealed thai in all cases, the induced protein is highly degraded, and. as assessed by the 
absence of identifiable induced protein bands by Coomassic Blue staining, are expressed only 
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at low levels. These results suggest that expression of high levels of intact toxin A 
recombinant protein is not possible when large regions of the toxin A gene are expressed in 
E. coli using these expression vectors. 

c) High Level Expression Of Small Toxin A Protein Fusions in 
E. coli 

Experience indicates that expression difficulties are often encountered when large 
(greater than 100 kc!) fragments are expressed in /;. coli, A number of expression constructs 
containing smaller fragments of the toxin A gene were constructed, to determine if small 
regions of the gene can be expressed to high levels without extensive protein degradation. A 
summary of these expression constructs are shown in Figure 9. All were constructed by in- 
frame fusions of convenient toxin A restriction fragments to either the pMALc or pET23a-c 
vectors. Protein preparations from induced cultures of each of these constructs were analyzed 
by both Coomassie Blue staining and Western analysis as in (b) above. In all cases, higher 
levels of intact, full length fusion proteins were observed than with the larger recombinants 
from section (b). 



il) Purification Of Recombinant Toxin A Protein 

Large scale (500 ml) cultures of each recombinant from (c) were grown, induced, and 
soluble and insoluble protein fractions were isolated. The soluble protein extracts were 
affinity chromatographed to isolate recombinant fusion protein, as described [Williams ct al. 
(1094). supray In brief, extracts containing lagged pI-T fusions were chromatographed on a 
nickel chelate column, and elutcd using imidazole salts as described by the distributor 
(Novagen). Extracts containing soluble pMAL fusion protein were prepared and 
chromatographed in column buffer ( 10 mM NaPO,, 0.5M NaCL 10 mM (i-mereaptoethanol. 
pH 7.2) over an amylose resin column (New England Biolabs). and elutcd with column 
buffer containing 10 mM maltose as described [Williams a al. (1995). supra\. When the 
expressed protein was found to be predominantly insoluble, insoluble protein extracts were 
prepared by the method described in Example 17. infra. The results are summarized in Tabic 
16. figure 10 shows the sample purifications of recombinant toxin A protein. In this figure, 
lanes I and 2 contain MBP fusion protein purified by affinity purification of soluble protein. 
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TABLE 16 

Purification Of Recombinant Toxin A Protein 



Clone , '" 


Protein 
Solubility 


Yield Affinity 
Purified Soluble 
Protein 00 


% Intact Soluble 
Fusion Protein ,c> 


Yield Intact 
Insoluble Fusion 
Protein 


pMA30-270 


Soluble 


4 mg/500 mis 


10% 


NIA 


PMAjO-300 


Soluble 


4 mg/500 mis 


5-10% 


KJ A 


pMA300-66() 


insoluble 




NA 


i \j me/ juu m I 


pMA66()-l 100 


Soluble 


4.5 mg/500 mis 


50% 


MA 


pMA 1100-1610 


Soluble 


18 mg/500 mis 


10% 


NA 


pMA16IO-l870 


Both 


22 mg/500 mis 


90% 


20 mg/500 ml 


pMAI450-I870 


insoluble 




NA 


0.2 mg'500 ml 


pPAl 100-1450 


Soluble 


0. 1 mg/500 mis 


90% 


NA 


pI'AI 100-1870 


Soluble 


0.02 mg/500 mis 


90% 


NA 


pMA 1870-2680 


Both 


12 mg/500 mis 


80% 


NA 


pPa 18 70-2680 


Insoluble 




NA 


10 mg/500 ml 



2 P p PCT23 vector. pM-pMALc vector, A=toxin A. 

Based on 1.5 OD : , M - I mg/nil (extinction coefficient of MBP>. 
Intimated by Coomassie staining of SDS-PAGE uels. 

Lanes 3 and 4 contain MBP fusion protein purified by solubilization of insoluble inclusion 
bodies. The purified fusion protein samples are pMAI 870-2680 (lane I ). pM.A66()-l 100 (lane 
2). pMA.100-600 (lane 3) and pMAI450-I87() (lane 4). 

Poor yields of affinity purified protein were obtained when poly-histidine tagged pET 
vectors were used to drive expression (pPAl 100-1450. pPl 100-1870). However, significant 
protein yields were obtained from pMAI. expression constructs spanning the entire toxin A 
gene, and yields of full-length soluble fusion protein ranged from an estimated 200-400 
Mg/500 ml culture (pMA30-300) to greater than 20 mg/500 ml culture (pMA16IO-1870). 
Only one interval was expressed to high levels as strictly insoluble protein (pMA300-660). 
Thus, although high level expression was not observed when using large expression constructs 
Mom the toxin A gene, usable levels of recombinant protein spanning the entire toxin A gene 
were obtainable by isolating induced protein from a series of smaller pMAI. expression 
constructs that span the entire toxin A gene. This is the first demonstration of the feasibility 
of expressing recombinant toxin A protein to high levels in £ a>li. 
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c) Hemagglutination Assay Using The Toxin A Recombinant 
Proteins 

The carboxy terminal end consisting of the repeating units contains the 
hemagglutination activity or binding domain of ( '. difficile toxin A. To determine whether 
5 the expressed toxin A recombinants retain functional activity, hemagglutination assays were 

performed. Two toxin A recombinant proteins, one containing the binding domain as either 
soluble affinity purified protein (pMA 1 870-2680) or SDS solubilized inclusion body protein 
(pPA I 870-2680) and soluble protein from one region outside that domain (pMAI 100-1610) 
were tested using a described procedure. [H.C Krivan a. *//.. Infect. Immun.. 53:573 

10 (1986). | Citrated rabbit red blood cells (RRBO(Cocalico) were washed several times with 
Tris-buffer ( 0.1M Tris and 50 mM NaCI ) by centrifugation at 450 x g for 10 minutes at 4° 
C. A l% RRBC suspension was made from the packed cells and resuspended in T ris-buffer. 
Dilutions of the recombinant proteins and native toxin A (Tech Labs) were made in the Tris- 
buffer and added in duplicate to a round-bottomed %-weil microliter plate in a final volume 

1 5 of I00 M l. To each well. 50 pa I of the l% RRBC suspension was added, mixed by gentle 
tapping, and incubated at 4°C for 3-4 hours. Significant hemagglutination occurred only in 
the recombinant proteins containing the binding domain (pMA 1 870-2680) and native toxin 
A. The recombinant protein outside the binding domain (pMA 1 1 00- 1 61 0) displayed no 
hemagglutination activity. Using equivalent protein concentrations, the hemagglutination titer 

20 for toxin A was 1:256. while titers for the soluble and insoluble recombinant proteins of the 
binding domain were 1:256 and about 1:5000. Clearly, the recombinant proteins tested 
retained functional activity and were able to bind RRBC's. 



EXAMPLE 12 

runciional Activity Of IgY Reactive Against Toxin A Recombinants 

I he expression of recombinant toxin A protein as multiple fragments in L.coli has 
demonstrated the feasibility of generating toxin A antigen through use of recombinant 
methodologies (Example 1 1 ), The isolation of these recombinant proteins allows the 
immunoreaetivity of each individual subrcgion of the toxin A protein to be determined {i.e.. 
in a antibody pool directed against the native toxin A protein). This identifies the regions (if 
any) for which little or no antibody response is elicited when the whole protein is used as a 
tmmunogen. Antibodies directed against specific fragments of the toxin A protein can be 
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purified by af finity chromatography against recombinant toxin A protein, and tested for 
neutralization ability. This identifies any toxin A subregions tha, are essential for producing 
neutralizing antibodies. Comparison with the levels of immune response directed auains, ^ 
these intervals when native toxin is used as an immunogen predicts whether potentially higher 
uters o, neutralizing antibodies can be produced by using recombinant protein directed against 
a .ndmduul region, rather than the entire protein. Finally, since i, is unknown whether 
-mooches reactive to the recombinant toxin A proteins produced in Example 11 neutralize 
«ox.n A as elfectivelv as antibodies raised against native toxin A (Samples 9 and 10) the 
protective ability of a poo. of antibodies affinity purified against recombinant toxin A 
Iragments was assessed for its ability to neutralize toxin A. 

This Example involved (a) epitope mapping of the toxin A protein to determine the 
u. specfic antibodies directed against individual subregions of the toxin A protein when 
nat.ve loxm A protein is used as an immunogen. (b) affinity purification of | ttY reactive 
a,a,nst recombinant proteins spanning the toxin A gene, (c, toxin A neutralization assavs with 
»"■"»>• l^nhed IgY reactive to recombinant toxin A protein to identify subreuions of the 

A protein that induce the production of neutralizing antibodies, and determination of 
whether complete neutralization of toxin A can be elicited with a mixture of antibodies 
reactive to recombinant toxin A protein. 

«) Epitope Mapping Of The Toxin A (ienc 

The aflin.ty purification of recombinant toxin A protein specific to defined intervals of 
'he tox.n A protein allows epitope mapping of antibody pools directed auains, native toxin A 
I l»s lu, no, previously been possible, since previous expression of toxin A recombinants has 
heen assessed only by Western bio, analysis, without knowledge of the expression levels of 
■he protan f,,,.. von Eichel-Streibcr « a,, j. Gen. Microbiol.. 135:55-64 ( ,989)). Thus hi»h 
or low reaenvty of recombinant toxin A protein on Western blots mav reflect protein 
expression level differences, not immunoreactivity differences. Given that the purified 
recombinant protein generated in Example I 1 have been quantised, the issue of relative 
-mmunoreactivitv of individual regions of the toxin A protein was precisely addressed 

< <"• the purposes of this Example, the toxin A protein was subdivided into 6 intervals 
(1-6). numbered from the amino (interval I) to the carboxyl (interval 6, termin, 

The recombinant proteins corresponding ,„ these intervals were from expression clones 
<m*. hxample 1 1(d) for clone designations, pMA30-3O0 (interval 1 ,. P MA30*660 (interval 
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2). pMA660-l 100 (interval 3). pPAl 100-1450 (interval 4). pMA1450-l870 (interval 5) and 
pMA 1 870-2680 (interval 6). These 6 clones were selected because they span the entire 
protein from amino acids numbered 30 through 2680. and subdivide the protein into 6 small 
intervals. Also, the carbohydrate binding repeat interval is contained specificallv in one 

5 interval (interval 6). allowing evaluation of the immune response specifically directed aizainst 
this region. Western blots of 7.5% SDS-PAGR gels, loaded and eleclrophoresed with defined 
quantities of each recombinant protein, were probed with either gout antitoxin A polyclonal 
antibody ( Tech Lab) or chicken antitoxin A polyclonal antibody |pC.TA IgY. Hxample K(c)]. 
The blots were prepared and developed with alkaline phosphatase as previously described 

0 [Williams c/ al (1095). supra]. At least 90% of all reactivity, in cither goal or chicken 

antibody pools, was found to be directed against the ligand binding domain (interval 6). The 
remaining immunorcactivity was directed against all five remaining intervals, and was similar 
in both antibody pools, except that the chicken antibody showed a much lower reactivitv 
against interval 2 than the goat antibody. 

5 This clearly demonstrates that when native toxin A is used as an immunogen in goats 

or chickens, the bulk of the immune response is directed against the ligand binding domain of 
the protein, with the remaining response distributed throughout the remaining 2/3 of the 
protein. 

0 b) Affinity Purification Of IgY Reactive Against Recombinant 

Toxin A Protein 

Affinity columns, containing recombinant toxin A protein from the 6 defined intervals 
in (a) above, were made and used to (i) affinity purify antibodies reactive to each individual 
interval from the CTA IgY preparation | Example 8(c)|. and (ii) deplete interval specific 

5 antibodies from the CTA IgY preparation. Affinity columns were made by coupling 1 ml of 
PBS-washed Actigel resin (Sterogene) with region specific protein and 1/10 final volume of 
Aid-coupling solution (IM sodium eyanoborohydride). The total region specific protein 
added to each reaction mixture was 2.7 mg (interval 1). 3 mg (intervals 2 and 3). 0.1 nig 
(interval 4). 0.2 mg (interval 5) and 4 mg (interval 6). Protein for intervals 1. 3. and 6 was 

0 affinity purified pMAI fusion protein in column buffer (see lixample 1 I). Interval 4 was 
affinity purified poly-hislidine containing pET fusion in PBS: intervals 2 and 5 were from 
inclusion body preparations of insoluble pMAI. fusion protein, dialy/ed extensivelv in IM3S. 
Alkjuots of the supcrnatants from the coupling reactions, before and after coupling, were 
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assessed by Coomassie staining of 7.5% SDS-PAGE gels. Based on protein band intensities, 
in all cases greater than 50% coupling efficiencies were estimated. The resins were poured 
into 5 ml BioRad columns, washed extensively with PBS. and stored at 4°C. 

Aliquots of the C I" A IgY polyclonal antibody preparation were depleted for each 
individual region as described below. A 20 ml sample of the CTA IgY preparation [Sample 
X(c)| was dialy/cd extensively against 3 changes of PBS ( I liter for each dialysis), quantitated 
by absorbance at 01),,,,. and stored at 4°C. Six I ml aliquots of the dialyzed IgY preparation 
were removed, and depleted individually for each of the six intervals. Kach 1 ml aliquot was 
passed over the appropriate affinity column, and the eluate twice reapplied to the column. 
The eluate was collected, and pooled with a I ml PBS wash. Bound antibody was eluted 
IVom the column by washing with 5 column volumes of 4 M Cuanidine-MCI (in 10 mM Tris- 
MCI. pll 8.0). The column was reequilibrated in PBS. and the depleted antibody stock 
reapplied as described above. The eluate was collected, pooled with a 1 ml PBS wash, 
quantised by absorbance at OD, w . and stored at 4° C. |„ lh is manner. 6 aliquots of the CTA 
IgY preparation were individually depleted for each of the 6 toxin A intervals, by two rounds 
ol affinity depletion. The specificity of each depleted stock was tested by Western blot 
analysis. Multiple 7.5% SDS-PAGIi gels were loaded with protein samples corresponding to 
all A toxin A subregions. After electrophoresis, the gels were blotted, and protein transfer 
continued by Ponceau S staining (protocols described in Williams a al. (1995). supra]. After 
blocking the blots I hr at 2()°C in PBS + 0.1% Tween 20 (PBST) containing 5% milk (as a 
blocking buffer). 4 mi of either a 1/500 dilution of the dialyzed CTA IgY preparation in 
blocking buffer, or an equivalent amount of the six depleted antibody stocks (using OD, s „ io 
standardize antibody concentration) were added and the blots incubated a further I hr at room 
temperature. The blots were washed and developed with alkaline phosphatase (using a rabbit 
anti-chicken alkaline phosphate conjugate as a secondary antibody) as previously described 
IWilliams c, al. (1995). supra). In all eases, only the target interval was depleted for antibody 
reactiviiy. and at least 90% of the reactivity to the target intervals was specifically depleted. 

Kegion specific antibody pools were isolated by affinity chromatography as described 
below, fen mis of the dialyzed CTA IgY preparation were applied sequentially to each 
affinity column, such that a single 10 ml aliquot was used to isolate region specific antibodies 
specific to each of the six subregions. The columns were sequentially washed with 10 
volumes of PBS. 6 volumes of BBS-Tween. 10 volumes of TBS. and eluted with 4 ml 
Actisep elulion media (Sterogene). The eluate was dialyzed extensively against several 
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changes of PBS. and the affinity purified antibody collected and stored at 4°C. The volumes 
of the eluate increased to greater than 10 mis during dialysis in each case, due to the high 
viscosity of the Actisep clution media. Aliquots of each sample were 20x concentrated using 
Centricon .10 microconcentrators (Amicon) and stored at 4°(\ The specificity of each region 
specific antibody pool was tested, relative to the dialyzed C I A IgY preparation, by Western 
blot analysis, exactly as described above, except that 4 ml samples of blocking buffer 
containing 100 M | region specific antibody (unconccntrated) were used instead of the depleted 
CI A IgY preparations. Each affinity purified antibody preparation was specific to the defined 
interval, except that samples purified against intervals 1-5 also reacted with interval 6. This 
may be clue to non-specific binding to the interval 6 protein, since this protein contains the 
repetitive ligand binding domain which has been shown to bind antibodies nonspecifically. 
|Lyerh ct a/.. Curr. Microbiol.. I9:30.V306 (1989).| 

fhe reactivity of each affinity purified antibody preparation to the corresponding 
proteins was approximately the same as the reactivity of the diluted dialyzed CTA IgY 
preparation standard. Given that the specific antibody slocks were diluted 1/40. this would 
indicate that the unconcentrated affinity purified antibody stocks contain 1/10-1/20 the 
concentration of specific antibodies relative to the starling C I A IgY preparation. 

c) Toxin A Neutralization Assay Using Antibodies Reactive 
Toward Recombinant Toxin A Protein 

flic CIIO toxin neutralization assay [Example 8(d)| was used to assess the ability of 
the depleted or enriched samples generated in (b) above to neutralize the cytotoxicity of toxin 
A. The general ability of affinity purified antibodies to neutralize toxin A was assessed by 
mixing together aiiquols of all 6 concentrated stocks of the 6 affinity purified samples 
generated in (b) above, and testing the ability of this mixture to neutralize a toxin A 
concentration of 0.1 M g/ml. The results, shown in Figure 11. demonstrate almost complete 
neutralization of toxin A using the affinity purified (AP) mix. Some epitopes within the 
recombinant proteins utilized for affinity purification were probably lost when the proteins 
were denatured before affinity purification [by (iuanidine-1 ICI treatment in <b) above). Thus, 
the neutralization ability of antibodies directed against recombinant protein is probably 
underestimated using these affinity purified antibody pools. I bis experiment demonstrates 
that antibodies reactive to recombinant toxin A can neutralize cytotoxicity, suggesting that 
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neutralizing antibodies may be generated by using recombinant toxin A protein as 
iminunogen. 

In view of the observation that the recombinant expression clones of the toxin A gene 
divide the protein into 6 subregions. the neutralizing ability of antibodies directed against each 
individual region was assessed. The neutralizing ability of antibodies directed against the 
hgand binding domain of toxin A was determined first. 

In the toxin neutralization experiment shown in Figure II. interval 6 specific 
antibodies (interval 6 contains the ligand binding domain) were depleted from .he dialyzed 
I'F.G preparation, and the effect on toxin neutralization assayed. Interval 6 antibodies were 
depleted either by utilizing the interval 6 depleted CTA IgY preparation from (b) above ("-6 
alT. depleted" in Figure II). or by addition of interval 6 protein to the CTA IgY preparation 
(estimated to be a 10 fold molar excess over ami-interval 6 immunoglobulin present in this 
prepara.ion, to competitively compete for interval o proton ("-6 prot depleted" in Figure II). 
In both instances, removal of interval 6 specific antibodies reduces the neutralization 
efficiency relative to the starting CTA IgY preparation. This demonstrates that antibodies 
directed against interval 6 contribute to toxin neutralization. Since interval 6 corresponds to 
Hie ligand binding domain of the protein, these results demonstrate that antibodies directed 
against this region in the PF.G preparation contribute to the neutralization of toxin A in this 
assay. However, it is significant that after removal of these antibodies, the PLC. preparation 
retains significant ability to neutralize toxin A (Figure II). This neutralization is probably 
due to the action of antibodies specific to other regions of the toxin A protein, since at least 
<K>% „f .he ligand binding region reactive antibodies were removed in the depleted sample 
prepared in (b) above. This conclusion was supported by comparison of the toxin 
neutralization of the affinity purified (AP) mix compared to affinity purified interval t, 
antibody alone. Although some neutralization ability was observed with Al> interval 6 
antibodies alone, the neutralization was significantly less than that observed with the mixture 
of all 6 AP antibody stocks (not shown). 

(liven that the mix of all six affinity purified samples almost completely neutralized 
Hie cytotoxicity of toxin A (Figure 1 I ). the relative importance of antibodies directed against 
toxin A intervals 1-5 within the mixture was determined. This was assessed in two way's. 
First, samples containing affinity purified antibodies representing 5 t ,( i| 1c f) inlerva , s werc 
prepared, such, that each individual region was depleted from one sample. Figure 12 
demonstrates a sample neutralization curve, comparing the neutralization ability of affinity 
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purified antibody mixes without interval 4 (-4) or 5 (-5) specific antibodies, relative to the 
mix of all 6 affinity purified antibody stocks (positive control). While the removal of interval 
5 specific antibodies had no effect on toxin neutralization (or intervals 1-3. not shown), the 
loss of interval 4 specific antibodies significantly reduced toxin neutralization (figure 12). 

Similar results were seen in a second experiment, in which affinity purified antibodies, 
directed against a single region, were added to interval 6 specific antibodies, and the effects 
on toxin neutralization assessed. Only interval 4 specific antibodies significantly enhanced 
neutralization when added to interval 6 specific antibodies ( figure 13). These results 
demonstrate that antibodies directed against interval 4 (corresponding to clone pPAl 100-1450 
in Eiyure <)) are important for neutralization of cytotoxicity in this assay. Hpitope mapping 
has shown that only low levels of antibodies reactive to this region are generated when native 
mxin A is used as an immunogen (Example 12(a)|. It is hypothesized that immunization with 
recombinant protein specific to this interval will elicit higher tilers of neutralizing antibodies. 
In summary, this analysis has identified two critical regions of the toxin A protein against 
which neutralizing antibodies ate produced, as assayed by the CHO neutralization assay. 

EXAMPLE 13 

Production And Evaluation Of Avian Antitoxin 
Against ('. difficile Recombinant Toxin A Polypeptide 

In Example 12. we demonstrated neutralization of toxin A mediated cytotoxicity by 
affinity purified antibodies reactive to recombinant toxin A protein. To determine whether 
antibodies i^iiscd against a recombinant polypeptide fragment of ( difficile toxin A may be 
ellective in treating clostridial diseases, antibodies to recombinant toxin A protein representing 
the binding domain were generated. Two toxin A binding domain recombinant polypeptides, 
expressing the binding domain in either the pMALc ( pM A 1 870-2680) or pLT 23(pPAI870- 
2680) vector, were used as immunogens. The pMAE protein was affinity purified as a soluble 
product lExample 12(d)] and the pET protein was isolated as insoluble inclusion bodies 
| Example 12(d)] and solubilized to an immunologically active protein using a proprietary- 
method described in a pending patent application (U.S. Patent Application Serial No. 
OX/!2'J.027). This Example involves (a) immunization, (h) antitoxin collection, (c) 
determination of antitoxin antibody titer, (d) anti-recombinant toxin A neutralization of toxin 
A hemagglutination activity in vino, and (e) assay of in vitro toxin A neutralizing activity. 
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The soluble and the inclusion body preparations each were used separately to 
immunize hens. Both purified toxin A polypeptides were diluted in PBS and emulsified with 
approximately equal volumes of CFA for the initial immunization or IFA for subsequent 
booster immunizations. On day zero, for each of the recombinant preparations, two egg 
laying white Leghorn hens (obtained from local breeder) were each injected a. multiple sites 
(intramuscular and subcutaneous) with I ml of recombinant adjuvant mixture containing 
approximately 0.5 to 1.5 mgs of recombinant toxin A. Booster immunizations of 1.0 mg 
were given on days 1 4 and dav 28. 



b) Antitoxin Collection 

Total yolk immune IgY was extracted as described in the standard PFG protocol (as in 
Sample I , and the final IgY pellet was dissolved in sterile PBS at the original yolk volume. 
I his material is designated "immune recombinant IgY" or "immune IgY." 

«■') Antitoxin Antibody Titer 

To determine if the recombinant toxin A protein was sufficiently immunogenic to raise 
antibodies in hens, the antibody titer of a recombinant toxin A polypeptide was determined bv 
ELISA. liggs from both hens were collected on day .12. the yolks pooled and the antibody " 
was isolated using PEG as described. The immune recombinant IgY antibody titer was 
determined lor the soluble recombinant protein containing the maltose binding protein fusion 
generated in p-Mal ( P MA 1 870-2680). Nincty-six well Falcon Pro-bind plates were coated 
"vemight at 4°C with .00 M | /well of toxin A recombinant at 2.5 ug , u | it! PBS containing 
0.05%, thimerosal. Another plate was also coated with maltose binding protein (MBP) at the 
same concentration, to permit comparison of antibody reactivity to the fusion partner. The 
next day. the wells were blocked with PBS containing 1% bovine serum albumin (USA) for I 
hour a, 37-C. IgY isolated from immune or preimmune eggs was diluted in antibody diluent 
<I>US containing 1% BSA and 0.05% Tween-20). and added to the blocked wells and 
incubated for I hour a. 37'C . The plates were washed three times with PBS with 0.05% 
Tween-20. then three times with PBS. Alkaline phosphatase conjugated rabbit antt-chicken 
IgG (Sigma, diluted 1:1000 in antibody diluent was added to the plate, and incubated for 1 
hour at 37°C. The plates were washed as before and substrate was added, [p-nitrophenyl 
Phosphate (Sigma)| at I mg/ml in 0.05M Na : CO, pH 9.5 and 10 mM MgCh. The plates 
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were evaluated quantitatively on a Dynatech MR 300 Micro IIPA plate reader at 410 nm 
about 10 minutes after the addition of substrate. 

Based on these ELISA results, high antibody titers were raised in chickens immunized 
with the toxin A recombinant polypeptide. The recombinant appeared to be highly 
immunogenic, as it was able to generate high antibody titers relatively quickly with few 
immunizations. Immune IgY tiler directed specifically to the toxin A portion of the 
recombinant was higher than the immune IgY titer to its fusion partner, the maltose bindinu 
protein, and significantly higher than the prcimnume IgY. I.: LISA liters (reciprocal of the 
highest dilution of IgY generating a signal) in the preimmune IgY to the MBP or the 
recombinant was <I:30 while the immune IgY titers to MBP and the toxin A recombinant 
were 1:18750 and > 1:93750 respectively. Importantly, the anti-reeombinam antibody tilers 
generated in the hens against the recombinant polypeptide is much higher, compared to 
antibodies to that region raised using native toxin A. The recombinant antibody titer to 
region 1870-2680 in the CTA antibody preparation is at least five-fold lower compared to the 
recombinant generated antibodies (1:18750 versus ->l :93750V Thus, ii appears a belter 
immune response can be generated against a specific recombinant using that recombinant as 
the immunogen compared to the native toxin A. 

This observation is significant, as it shows that because recombinant portions stimulate 
the production of antibodies, it is not necessary to use native toxin molecules to produce 
antitoxin preparations. Thus, the problems associated with the toxicity of the native toxin are 
avoided and large-scale antitoxin production is facilitated. 

d) Anti-Uccombinant Toxin A Neutralization Of Toxin A 
Hemagglutination Activity In Vitro 

Toxin A has hemagglutinating activity besides cytotoxic and enlerotoxin properties. 
Specifically, toxin A agglutinates rabbit erythrocytes by binding lo a trisaccharide (gal 1-3B1- 
4GlcNAc) on the cell surface. fH. fclrivan et al.. Infect. Immun.. 53:573-581 (1986V| We 
examined whether the anti-recombinant toxin A (immune IgY. antibodies raised against the 
insoluble product expressed in pET) can neutralize the hemagglutination activity of toxin A in 
vitro. The hemagglutination assay procedure used was described by Il.C. Krivan at al. 
Polyethylene gtycol-fraciionatcd immune or preimmune IgY were pre-absorbed with citrated 
rabbit erythrocytes prior to performing the hemagglutination assay because we have found that 
IgY alone can agglutinate red blood cells. Citrated rabbit red blood cells (RRBC'sKCoealico) 
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were washed twice by centrifugation at 450 x g with isotonic buffer (0.1 M Tris-HCI. 0.05 M 
NuCI. pH 7.2). RRBC-rcactive antibodies in the IgY were removed by preparing a 10% 
RRBC suspension (made by adding packed cells to immune or preimmune IgY) and 
incubating the mixture for I hour at 37°C. The RRBCs were then removed by centrifugation. 
Neutralization of the hemagglutination activity of toxin A by antibody was tested in round- 
bottomed %-well microliter plates. Twenty-five p.1 of toxin A (36 ug /ml) (Tech Lab) in 
isotonic buffer was mixed with an equal volume of different dilutions of immune or 
preimmune IgY in isotonic buffer, and incubated for 15 minutes at room temperature. Then. 
50 Ml of a 1% RRBC .suspension in isotonic buffer was added and the mixture was incubated 
lor 3 hours at 4°C. Positive control wells containing the final concentration of 9 M g/ml of 
toxin A alter dilution without IgY were also included. Hemagglutination activity was 
assessed visually, with a diffuse matrix of RRBCs coating the bottom of the well 
representing a positive hemagglutination reaction and a tight button of RRBCs at the bottom 
of the well representing a negative reaction. The anti-recombinani immune IgY neutralized 
toxin A hemagglutination activity, giving a neutralization titer of 1:8. However, preimmune 
IgY was unable to neutralize the hemagglutination ability of toxin A. 

e) Assay Of //; Vitro Toxin A Neutralizing Activity 

The ability of the anti-recombinant toxin A IgY (immune IgY antibodies raised against 
pMA 18711-2680. the soluble recombinant binding domain protein expressed in piVIAL. 
designated as Anli-tox. A-2 in figure 14 . and referred to as recombinant region <» and pre- 
immune IgY. prepared as described in Example 8(c) above, to neutralize the cytotoxic activity 
of toxin A was assessed in vim, using the CUO cell cytotoxicity assay, and toxin A (Tech 
l.ab) at a concentration of 0. lug/ml. as described in Example 8(d) above. As additional 
controls, the anti-naiive toxin A IgY (C.TA) and pre-immune IgY preparations descrihed in 
Lxample 8(c) above were also tested. The results are shown in Figure 14. 

The anti-recombinant toxin A IgY demonstrated only partial neutralization of the 
cytotoxic activity of toxin A. while the pre-immune IgY did not demonstrate any significant 
neuiralizin" activiiv. 
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EXAMPLE 14 

In vivo Neutralization Off. difficile Toxin A 

The ability of avian antibodies (IgY) raised against recombinant toxin A binding 
domain to neutralize the cnterotoxin activity of C difficile toxin A was evaluated /;, vim 
using Golden Syrian hamsters. The Example involved: (a» preparation of the avian ami- 
recombinant toxin A IgY lor oral administration: (b) in vivo protection of hamsters from < 
difficile toxin A entcrotoxicity by treatment of toxin A with avian ami-recombinant toxin A 
IgY: and (c) histologic evaluation of hamster ceca. 

:») Preparation Of The Avian Anti-Rccombinant Toxin A IgY 
For Oral Administration 

Liggs were collected from hens which had been immunized with the recombinant ( ■ 
dif/hi/v toxin A fragment pMA! 870-2680 (described in Example 13. above). A second group 
nl eggs purchased at a local supermarket was used as a pre-immune (negative) control, l-gg 
yolk immunoglobulin (IgY) was extracted by PEG from the two groups of eggs as described 
in Example 8(c). and the final IgY pellets were solubilized in one-fourth the original yolk 
volume using 0. 1 M carbonate buffer (mixture of Nal-ICO, and Na.C'O,). pi I 9.5. The basic 
carbonate buffer was used in order to protect the toxin A from the acidic pi I of the stomach 
environment. 

I» fit vivo Protection Of Hamsters Ajjainst C. difficile Toxin A 
Entcrotoxicity By Treatment Of Toxin A With Avian Anti- 
rccombinant Toxin A IgY 

In order to assess the ability of the avian anti-recombinant toxin A IgY. prepared in 
section (a) above to neutralize the in vivo enlerotoxin activity of toxin A. an in vivo u »xin 
neutralization model was developed using Golden Syrian hamsters. This model was based on 
published values for the minimum amount of toxin A required to elicit diarrhea (0.08 mg 
toxin A/Kg body wt.) and death (0.16 mg toxin A/Kg bod> wu in hamsters when 
administered orally (l.yerly a <//. Infect. Immun.. 47:349-."?52 (1985). 

Tor the study, four separate experimental groups were used, with each group consisting 
of 7 female Golden Syrian hamsters (Charles River), approx. three and one-hall weeks old. 
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weighing approx. 50 gms each. The animals were housed as groups of 3 and 4. and were 
offered food and water ucl libitum through the entire length of the study. 

Vor each animal, a mixture containing either 1 0 M g of toxin A (0.2 mg/K_) or 10 M g of 
toxin A ((,.6 mg/Kg) (C. ,/////,//, toxin A was oblajned from ^ ^ ] m \ Qf ^ ^ 

anu-rccombinant toxin A IgY or pre-immune IgY (from section «a) above) was prepared 
These m ix, urc .s were incubated a, 37°C for 60 min. and were then administered to the animal, 
by iho oral route. The animals were .hen observed for the onset of diarrhea and death for a 
penod ol 24 hrs. following the administration of the toxin A * IgY mixtures, a. the end of 
which time, the following results were tabulated and shown in Table 17: 

TABLE 17 

Study Outcome Al 2-1 Hours 



f-Nperimenlal Group 



Study Outcome ;» 24 Hours 



1-lcalihv' 



Di;irrhc;r 



Dead 



1(1 l " xi " A • Aiiiiioxm Ayainsi Inierval 6 



I) 



f " xin A ' Aniiioxin Against Interval 6 



111 lox '" A • I're-lmmune Serum 



M) iiu I ox in A • Hi e- Immune 



Animals renamed healthy ihrough the entire 24 hour studv period 
Animals developed diarrhea, but did not die. 
Animals developed diarrhea, and subsequently died. 

IVe.rcatmen, of toxin A at both doses tested, using the an.i-rccombinam toxin A IgY 
prevented all over, symptoms of disease in hamsters. Therefore, pretreatment of C ,/,//;„/, 
««x.n A. using the an.i-recombinan, ,oxin A IgY. neutralized the /„ enterotoxm activity 
,h ° l ° X,n A - 1,1 CO,1,rast - aM a,lilllals l-rom the two groups which received toxin A which 
had been predated using pre-immune IgY developed disease symptoms which ranucd from 
"■arrhca to death. The diarrhea which developed in the 5 animals which did not die in each 
"I «he two pre-immune groups, spontaneously resolved by the end of the 24 hr. study period. 

c) Histologic Evaluation Of Hamster Ccca 

In order to further assess the ability of anii-rccombinan. toxin A |, Y to protect 
Hamsters from the endotoxin activity of toxin A. histologic evaluations were performed on 
tiK' ceca ol hamsters from the study described in section (b) above. 

Three groups ol' animals were sacrificed in order to prepare histological specimens 
he first group consisted of a single representative animal taken from each of the 4 groups of 
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surviving hamsters at the conclusion of the study described in section (b) above. These 
animals represented the 24 hr. timepoint of the study. 

The second group consisted of two animals which were not part of the study described 
above, but were separately treated with the same toxin A »■ pre-immune IgY mixtures as 
described for the animals in section (b) above. Both of these hamsters developed diarrhea, 
and were sacrificed 8 hrs. after the lime of administration of the toxin A pre-immune lgY 
mixtures. At the time of sacrifice, both animals were presenting symptoms of diarrhea. 
These animals represented the acute phase of the study. 

The final group consisted of a single untreated hamster from the same shipment of 
animals as those used for the two previous groups. This animal served as the normal control. 

Samples of cecal tissue were removed from the 7 animals described above, and were 
fixed overnight at 4°C using 10% buffered formalin. The fixed tissues were paraffin- 
embedded, sectioned, and mounted on glass microscope slides. The tissue sections were then 
stained using hematoxylin and eosin (II and E stain), and were examined by light microscopy. 

The tissues obtained from the two 24 hr. animals which received mixtures containing 
either lO^g or 30|.ig of toxin A and anti-recombinant toxin A lgY were indistinguishable from 
the normal control, both in terms of gross pathology, as well as at the microscopic level. 
These observations provide further evidence for the ability of anti-recombinam toxin A lgY to 
effectively neutralize the in vivo cmerotoxin activity of C difficile toxin A. and thus its ability 
to prevent acute or lasting toxin A-induced pathology. 

In contrast, the tissues from the two 24 hr. animals which received the toxin A + pre- 
immune lgY mixtures demonstrated significant pathology. In both of these groups, the 
mucosal layer was observed to be less organized than in the normal control tissue. The 
cytoplasm of the epithelial cells bad a vacuolated appearance, and gaps were present between 
the epithelium and the underlying cell layers. The lamina propria was largely absent. 
Intestinal villi and crypts were significantly diminished, and appeared to have been overgrown 
by a planar layer of epithelial cells and fibroblasts. Therefore, although these animals ovcrtlv 
appeared to recover from the acute symptoms of toxin A intoxication, lasting pathologic 
alterations to the cecal mucosa had occurred. 

The tissues obtained from the two acute animals which received mixtures of toxin A 
and pre-immune lgY demonstrated the most significant pathology. At the gross pathological 
level, both animals were observed to have severely distended ceca which were filled with 
watery, diarrhea-like material. At the microscopic level, the animal that was given the 
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mixture containing ,0,g of toxin A and pre-immune IgY was found to have a mucosal lavcr 
wh.ch had a ragged, damaged appearance, and a disorganized, compacted quality. The crvpts 
were largely absent, and numerous breaks in .he epithelium had occurred. There was also ™ 
■nllux «. ery.hrocy.es „ 1t0 spaces between ,he epithelial .aver and the underlying tissue The 
amma. which had received the mixture containing 30ug of toxin A and pre-immune IgY 
demonstrated the most severe pathology. The cecal tissue of this amma. had an appearance 
very s.m.lur to that observed in animals which had died from C. Jifficiic disease. Widespread 
dcs.ruct.on of the mucosa was noted, and .he epithelial layer had sloughed. Hemorrhagic 
-as containing ,arge numbers of erythrocy.es were very prevalent. All semblance of norma. 
— e architecture was absent from .his specimen. ,n ,rms of the presenta.ion of pathologic 
-cms. ,h,s /„ ,„.„ hamster ni()dc| ol . toxjn A . imoxicat| . on corrdaies vcn . dosdv u . h 

Prolog, consequences of C. Jifflalc disease in hams.ers. The resul.s presented in this 
•-ample demonstrate .ha, while anti-rccombinan, toxin A interval 6, , gY ,s capable of on.v 
!-",.!!> neutralizing t he cytotoxic activity of C. toxin A. .he same antibodv 

ollccne.y neutrals .00% of the /„ ,,,, enterotoxin activity of the toxi,, Whiu/i, ,s no. 
'"•ended .ha, „ lis invenlion b , ljmjlcd , 0 ^ ^ ^ ^ ^ ^ ^ 

and cmcrcunxici, v of ( Toxin A as two separate and distinct biological functions. * 

EXAMPLE 15 

//; l ivn Neutralization Of C. Difficile Toxin A Liy 
Antibodies Against Recombinant Toxin A Polypeptides 

The ability of avian antibodies directed against the recombinant C. </,///,//, loxin A 
bagmen, .870-2680 (as expressed by pMA 1 870-2680: see Example .3, neutralize the 
entero.ox.c activity of toxin A was demonstrated in Example .4. The ability of avian 
an-bodies UgYs, directed against other recombinant toxin A epitopes to neutra,i,e native 
■oxm A , a ™ was next evaluated. Phis example involved: (a, the preparation of IgYs 
a-„,t recombinant toxin A polypeptides: (b, ,„ ,,,,, protection of hamsters auains, toxin A 

* -eaunent w,,h ami-recombinan, toxin A IgYs and ,c) quantisation of specie antibodv 
concentration in CTA and Interval 6 IgY PEG preparations. 

SFO m'^'T SeqUC,1CC °'' COdi " y rCSi ° n ° f Cmire "« in A — » « 
■IQ NO:,, fhe amino acid sequence of the entire tox.n A protein is listed in SFQ II) 

• «*• 1 1- am.no acid sequence consisting of amino acid residues .870 throuuh of 
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loxin A is listed in SHQ ID NO:7. The amino acid sequence consisting of amino acid 
residues 1870 through I960 of toxin A is listed in SFQ II) N():8. 

a) Preparation Of IrY\s Against Recombinant Toxin A 
Polypeptides 

Lggs were collected from Leghorn hens which have been immunized with recombinant 
( ' difficile loxin A polypeptide fragments encompassing the entire toxin A protein. The 
polypeptide fragments used as immunogens were; I ) pMA 1870-2680 (Interval 6). 2) pPA 
1 100-1450 (Interval 4). and 3) a mixture of fragments consisting of pMA 30-300 {Interval 1), 
pMA 300-660 (Interval 2), pMA 660-1 100 (Interval 3) and pMA 1450-1870 (Interval 5). 
This mixture of immunouens is referred to as Interval 1235. The location of each interval 
within the toxin A molecule is shown in Figure I5A. In Figure 15A. the foliowinu 
abbreviations are used: pP refers to the pLT23 vector {New Lngland BioLabs): pM refers to 
the piVlAI. IM -e vector (New Lngland BioLabs): A refers to toxin A: the numbers refer to the 
amino acid interval expressed in the clone. (For example, the designation pMA30-3()0 
indicates that the recombinant clone encodes amino acids 30-300 of toxin A and the vector 
used was pMAL IM -c). 

The recombinant proteins were generated as described in Lxample 1 1. The IgYs were 
extracted and solubili/ed in 0.1M carbonate buffer pi I 9.5 for oral administration as described 
in Lxample 14(a). The IgY reactivities against each individual recombinant interval was 
evaluated by LLISA as described in Lxample 13(c), 

l>) //; Vivo Protection Of Hamsters Against Toxin 

A By Treatment With Anti-Kccombinant Toxin A Antibodies 

The ability of antibodies raised against recombinant toxin A polypeptides to provide in 
vivo protection against the enlerotoxic activity of toxin A was examined in the hamster model 
system. This assay was performed as described in Lxample 14(b). Briefly, for each 40-50 
gram female Golden Syrian hamster (Charles River). I ml of IgY 4X (i.e.. resuspended in 1/4 
of the original yolk volume) PEG prep against Interval 6. Interval 4 or Interval 1235 was 

mixed with 30 Mg (1.0 oral dose) of C. difficile toxin A (Tech Lab). Preimmune IgY 

mixed with loxin A served as a negative control. Antibodies raised against ( \ difficile toxoid 
A (Lxample 8) mixed with toxin A (CTA) served as a positive control. The mixture was 
incubated for I hour at 37°C then orally administered to lightly etherized hamsters using an 
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IXG feeding needle. The animals were then observed for the onset of diarrhea and death for 
a period of approximately 24 hours. The results are shown in Table 18. 

TABLE 18 



Treat men i uroup 


Healthy* 


Diarrhea" 


Dead 1 


Pre immune 


0 


0 


7 


OA 


s 


0 


0 


Intervnt ft 


ft 


1 


0 


Interval 4 


0 


1 


ft 


Interval 1235 


0 


0 


7 



' Animal shows no sign of illness. 

Animal developed diarrhea, bin did not die. 
Animal developed diarrhea and died. 

IVe-trcatmcnt of toxin A with IgYs against Interval 6 prevented diarrhea in ft of 7 
hamsters and completely prevented death in all 7. In contrast, as with preimmunc IgY. IgYs 
against Interval 4 and Interval 1235 had no effect on the onset of diarrhea and death in the 
hamsters. 



c) Quantification Of Specific Antibody Concentration In CTA 
And Interval 6 IrY PEG Preparations 

To determine Hie purity of IgY PUG preparations, an aliquot of a pMA 1 870-2680 
(Interval ft) luY PEG preparation was chromatographed using IIPI.C" and a KW-X03 sizing 
column (Shodexi. The resulting profile of absorbancc at 280 nm is shown in figure I ft. The 
single large peak corresponds to the predicted MW of IgY. Integration of the area under the 
single large peak showed that greater than 95% of the protein elutcd from the column was 
present in this single peak. This result demonstrated that the majority (><)S%) 0 f the material 
absorbing at 280 nm in the PEG preparation corresponds to IgY. Therefore, absorbancc at 
280 nm can be used to determine the total antibody concentration in PEC, preparations. 

To determine the concentration of Interval 6-spccific antibodies (expressed as percent 
of total antibody) within the CTA and pMA 1870-2680 (Interval 6) PEG preparations, defined 
quantities of these antibody preparations were affinity purified on a pPA 1 870-2680(1 1) (shown 
schematically in figure 15B) affinity column and the specific antibodies were quantified. In 
figure I5B the following abbreviations arc used: pP refers to the pET23 vector (New England 
liioLabs): P M refers to the pMAI. IM -c vector (New England BioLabs): P G refers to the pGEX 
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vector (Pharmacia): pB refers to the PinPoint™ Xa vector (Promega): A refers to toxin A: the 
numbers refer to the amino acid interval expressed in the clone. The solid black ovals 
represent the MBP: the hatched ovals represent glutathione S-transf'crasc: the hatched circles 
represent the biotin tag: and HUM represents the poly-hislidine tag. 

An affinity column containing recombinant toxin A repeat protein was made as 
follows. Four ml of PDS-washcd Acligel resin (Sterogene) was coupled with 5-10 mg of 
pPA 1 870-2680 inclusion body protein | prepared as described in Fxamplc (17) and dialyzed 
into PBS | in a 15 ml lube (Falcon) containing 1/10 final volume Aid-coupling solution (I M 
sodium cyanoborohydride). Aliquots of the supernatant from the coupling reactions, before 
and after coupling, were assessed by Coomassie staining of 7.5% SDS-PAGF gels. Based 
upon protein hand intensities, greater than 6 mg of recombinant protein was coupled to the 
resin. The resin was poured into a 10 ml column (BioRad). washed extensively with PBS. 
pre-eluted with 4 M guanidine-UCI (in 10 mM Tris-HC'l. pi I 8.1): 0.005% ihimcrosal) and rc- 
equilihrated with PBS. The column was stored at 4T. 

Aliquots of a pMA 1870-2680 (Interval 6) or a CTA IgY polyclonal antibody 
preparation (PFG prep) were affinity purified on the above affinity column as follows. The 
column was attached to an I 'V monitor (ISCO) and washed with PBS. l or pMA 1870-2680 
IgY purification, a 2X Pl.-Ci prep (filter sterilized using a 0.45 p filter: approximately 500 mg 
total IgY) was applied. The column was washed with PBS until the baseline was re- 
established (the column How-through was saved), washed with BBSTween to elute 
nonspeciflcally binding antibodies and re-equilibrated with PBS. Bound antibody was eluted 
from the column in 4 M guanidinc-i IC1 (in 10 mM Tris-IICI. pil 8.0: 0.005% ihimcrosal). 
The entire elulion peak was collected in a 15 ml tube (falcon). The column was re- 
equilibraied and the column eluate was re-chromalographed as described above. The antibody 
preparation was quantified by l!V absorbance (the elution buffer was used to zero the 
spectrophotometer). Total purified antibody was approximately <) mg and I mg from the first 
and second chromatography passes, respectively. The low yield from the second pass 
indicated that most specific antibodies were removed by the llrst round of chromatography. 
I he estimated percentage of Interval 6 specific antibodies in the pMA 1 870-2680 PF.G prep is 
approximately 2%. 

The percentage of Interval 6 specific antibodies in the C TA PF.G prep was determined 
(utilizing the same column and methodology described above) to be approximately 0.5% of 
total ItiY. 
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A 4X PEG prep contains approximately 20 mg/ml IgY. Thus in b) above, 
approximately 400 Mg specific antibody in the Interval 6 PEG prep neutralized 30 ug toxin A 



in vivo. 



EXAMPLE 16 

In Vivo Treatment Of C. difficile Disease In 
Hamsters By Recombinant Interval (i Antibodies 

The ability of antibodies directed against recombinant Interval 6 of toxin A to protect 
hamsters /,, viva from C. diddle disease was examined. This example involved: (a) 
prophylactic treatment of ( '. difficile disease and (b) therapeiuie treatment of ( •. difficile 
disease. 



:») Prophylactic Treatment Of C. difficile Disease 

I bis experiment was performed as described in Example 9(h). Three .roups each 
ci.nsis.imj of 7 female 100 gram Syrian hamsters (Charles River, were prophvlacticallv treated 
with either preimmune l,Ys. IgYs against native toxin A and B [CTAB: see [Example H (a) 
«nd (b)| or IgYs against Interval 6. IgYs were prepared as 4X PHG preparations as described 
m hxample l )(a). 

The animals were orally dosed 3 times daily, roughly at 4 hour intervals, for 12 davs 
«-i'h 1 ml antibody preparations diluted in Ensure*, Using estimates of specific an.ibodv " 
concentration from Example 15(c). each dose of the Interval 6 anlibodv prep contained " 
approximately 400 ug of speeillc antibody. On day 2 each hamster was predisposed to C. 
difficile infection by the oral administration of 3.0 mg of Clindamycin-! ICI (Sigma) in I ml 
of water. On day 3 the banisters were orally challenged with I ml of C. difficile inoculum 
strain ATCC 43596 i„ sterile saline containing approximately 100 organisms. The animals 
were then observed for the onset of diarrhea and subsequent death during U,e treatment 
period. I he results are shown in Table 19. 
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TABLE V) 



Treatment Group 


Number Animals Alive 


Number Animals Dead 


Pre immune 


0 


7 


CTAB 


6 


1 


Inierval 6 


7 


0 



10 



30 



Treatment of hamsters with orally-administered IgYs against Interval 6 successfully 
protected 7 out of 7 (100%) of the animals from ( '. difficile disease. One of the hamsters in 
this group presented with diarrhea which subsequently resolved during the course of 
treatment. As shown previously in Example 9. antibodies to native toxin A and toxin 3 were 
highly protective. In this Example. f> out of 7 animals survived in the CTAB treatment group. 
All of the hamsters treated with preimmune sera came down with diarrhea and died. The 
survivors in both the C TAB and Interval 6 groups remained healthy throughout a 12 day post- 
treatment period. In particular. 6 out of 7 Interval 6-trcated hamsters survived at least 2 
weeks after termination of treatment which suggests that these antibodies provide a long- 
lasting cure. These results represent the first demonstration that antibodies generated against a 
recombinant region of toxin A can prevent CDAI) when administered passively to animals. 
Ihese results also indicate that antibodies raised against Interval 6 alone may he sufficient to 
protect animals from ( difficile disease when administered prophylacticallv. 

Previously others had raised antibodies against toxin A by actively immunizing 
hamsters against a recombinant polypeptide located within the Interval 6 region (Lyerly. 
D.M.. ct ul. (I Wo) Curr. Microbiol. 21:29). Figure 17 shows schematically the location of 
the Lyerly. ct al. intra-lnterval 6 recombinant protein (cloned into the pUC vector) in 
comparison with the complete Interval 6 construct (pM A 1870-2680) used herein to generate 
neutralizing antibodies directed against toxin A. In figure 17. the solid black oval represents 
the MBP which is fused to the toxin A Interval 6 in pM A 1 870-2680. 

I he Lyerly. ct al. antibodies (intra-lnterval 6) were only able to partialis protect 
hamsters against (*. difficile infection in terms of survival (4 out of 8 animals survived) and 
iurthermore. these antibodies did not prevent diarrhea in any of the animals. Additionally, 
animals treated with the intra-lnterval 6 antibodies [Lyerly. ct al. (I WO), supra] died when 
treatment was removed. 

In contrast, the experiment shown above demonstrates that passive administration of 
anti-Interval 6 antibodies prevented diarrhea in 6 out of 7 animals and completely prevented 



- 98 - 



WO 98/08540 PCT7US97/15394 

death due to CDAD. Furthermore, as discussed above, passive administration of the anti- 
Interval 6 antibodies provides a long lasting cure (i.e.. treatment could be withdrawn without 
incident). 



en 

s 



») Therapeutic Treatment Of C. difficile Disease: In Vivo 
Treatment Of An Established C difficile Infection In 
Hamsters With Recombinant Interval 6 Antibodies 

The ability of antibodies against recombinant interval 6 of toxin A to therapeutically 
feat t \ difficile disease was examined. The experiment was performed essentially as 
described in Kxample 10(h). Three groups, each containing seven to eiuht female Gold 
Syrian hamsters ( .(»() g each: Charles River) were treated with either preimmune IgY. IgY 
agams. native toxin A and toxin B (CTAB) and IgYs against Interval (, The antibodies were 
prepared as described above as 4X PFG preparations. 

The hamsters were first predisposed to ('. difficile infection with a .1 mi! dose of 
C-lindamycin-MC'l (Sigma, administered orally in I ml of water. Approximately 24 hrs later, 
the animals were orally challenged with I ml of C. difficile strain ATCC 435% ,„ ster ile 
saline containing approximately 200 organisms. One day after infection, the presence of toxin 
A and \\ uas determined in the feces of the hamsters using a commercial immunoassay kit 
(Cytoclone A+B LPA. Cambridge Biotech) to verify establishment of infection, four 
.^embers of each group were randomly selected and tested. Feces from an uninfected hamster 
was tested as a negative control. All infected animals tested positive for the presence of toxin 
according to the manufacturer's procedure. The initiation of treatment then started 
approximatelv 24 hr post-infection. 

The animals were dosed daily a. roughly 4 hr intervals with 1 ml antibody preparation 
diluted in linsurc* (Ross Labs). The amount of specific antibodies given per dose 
(determined by affinity purification) was estimated to be about 400 M g of anti-Interval 6 |n Y 
for animals in the Interval 6 group) and 100 ug and 70 pg of an.i-u.xin A (Interval 6- 
specific) and anti-toxin B (Interval ^-specific: see Example 19). respectively, for the CTAB 
preparation. The animals were treated for 9 days and then observed lor an additional 4 davs 
'<»■ the presence of diarrhea and death. The results indicating the number of survivors and the 
number of dead 4 days post-in lection lire shown in Table 20. 
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TABLE 20 



In vivo Therapeutic Treatment With Interval 6 Antibodies 



Treatment Group 


Number Animals Alive 


Number Animals Dead 


Preimmune 


4 


3 


CI AB 


8 


0 


Interval 6 


8 


(1 



Antibodies directed against both Interval 6 and CTAli successfully prevented death 
from ( ' difficile when therapeutically administered 24 hr after infection. This result is 
significant since many investigators begin therapeutic treatment of hamsters with existing 
drugs (t*. t*. . vancomycin, phenelfamycins. tiacumicins. etc.) 8 hr post-infection [Swanson. vt 
tti ( 1 Wl ) Antimicrobial Agents and Chemotherapy 35:1 108 and ( J. Antibiotics 42:94). 

forty-two percent of hamsters treated with preimmune lgY died from CDAI). While 
the anti-Interval 6 antibodies prevented death in the treated hamsters, they did not eliminate 
all symptoms of CDAD as 3 animals presented with slight diarrhea. In addition, one CTAB- 
trealcd and one preimmune-treated animal also had diarrhea 14 days post-infection. These 
results indicate that anti-Interval 6 antibodies provide an effective means of therapy for 
CIMl). 



-0 EXAMPLE 17 

Induction Of Toxin A Neutralizing Antibodies Requires Soluble Interval o Protein 



As shown in Examples i 1(d) and 15. expression of recombinant proteins in /:". coli 
may result in the production of either soluble or insoluble protein. If insoluble protein is 
produced, the recombinant protein is solubilizcd prior to immunization of animals. To 
determine whether, one or both of the soluble or insoluble recombinant proteins could be used 
to generate neutralizing antibodies to toxin A. the following experiment was performed. This 
example involved a) expression of the toxin A repeats and subj'ragments of these repeats in E. 
coli using a variety of expression vectors: b) identification of recombinant toxin A repeats and 
sub-regions to which neutralizing antibodies bind: and c) determination of the neutralization 
ability of antibodies raised against soluble and insoluble toxin A repeat immunogen. 
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«) Expression Of The Toxin A Repeats And Subfragments Of 
These Repeats In £ coli Using A Variety Of Express! n 
Vectors 

The Interval 6 immunogen utilized in Examples 15 ami 16 was the pMAl 870-?680 
pmiem. m which the toxin A repeats arc expressed as a solub.e fusion protein with the MBP 
(desenbed in Example 1 .). Interestingly, expression of .his region (from the .V„d , ite t0 the 
end o. the repeats, see Figure I SB) in three other expression constructs, as either native 
<pl>AIK7()-2680i. poly-l.is tagged [ P PA1870-2680 (II)] or biotm-tauged <pBA1870-">680) 
proteins resulted in completely insoluble protein upon induction of the bacterial host (see 
j^urc .>»>. The host strain B..2. (Novagen, was used tor expression of pBA 1 870-2680 and 

«n.in BL2KDFJ) (Novagen) was used for express,,,,, of pl>A, 870-2680 and pPA.870- 
2680,.,,. , hcsc insoluble p| , tcjns 1K , umulatcd u> ,, gh lcvds jn inc]us . on b()d(cs _^ 

»l recombinant p.asmids in E. ,„/, host cells grown in 2X YT medium was performed as 
described | Williams, ci al. (1095). supra]. 

As summarized in Figure I5B. expression of fragments of the toxin A repeats (as 
cher terminal . V ,,-A,„, v | fragment, or C-.erminal /uv,RI-end fragments, also vie.ded 
High levels o, insoluble protein using pCil-X (pGA 1 870-2 1 90). PinPoint™-Xa (pBA I870-' > I9() 
«nd PHA2250-2680, and pl:T expression systems (pPAl 870-2190). The pGFX and P ET 
expression systems are described in Example ... The PinPoin,- Xa expression svs.em drives 
the expression of fusion proteins in L ,„//. Fusion proteins from PinPoint'^-Xa vectors 
contam a biotin tag at the amino-terminal end and can be affinity purified Softl.ink"' Soft 
Kclease av.din resin (Promega) under mild denaturing conditions (5 n,M biotin,. 

The solubility of expressed proteins from the pPG 1 870-2 190 and pPA 1 870-^1 90 
expression constructs was determined after induction of recombinan, protein expression under 
enndmons reported to enhance protein solubility fThese conditions comprise .rowth of the 
Host a, reduced temperature , 3 <rC) and the utilization of high (I mM IPTG) iow (0 , mM 
H>rC„ concentrations of inducer [Williams, a,. (.995). .„„■„,. A „ exprc , scd 
-x,n A protein was insoluble under these conditions. Thus, expression of these fragments of 

^ A ,ePeatS in PET md "«* vectors results in the production of inso.uble 

•ecombmant protein even when the bos, ce.ls are grown a, reduced temperature and usinu 
'o-r concentrations of the inducer. Although expression of these fragments in pMa. vectors 
> -Ided affinity purillab.e solub.e fusion protein, the protein was either predominant 
-./uble (pMA,870-2,9(„ or unstable ( P MA2250-2650,. Attempts to solubiiize expressed 
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protein from the pMA 1870-2 190 expression construct using reduced temperature or lower 
inducer concentration (as described above) did not improve fusion protein solubility. 

Collectively, these results demonstrate that expression of the toxin A repeat region in 
Fi. coli results in the production of insoluble recombinant protein, when expressed as either 
large (aa 1870-2680) or small <aa 1870-2190 or aa 2250-2680) fragments, in a variety of 
expression vectors (native or poly-his lagged pET. pGLX or PinPoint IM - Xa vectors), utilizing 
growth conditions shown to enhance protein solubility. The exception to this rule were 
fusions with the MBI\ which enhanced protein solubility, either partially (pMAI 870-2190) or 
fully <pMA 1 870-2680). 

I)) Identification Of Recombinant Toxin A Repeats And Sub- 
Regions To Which Neutralizing Antibodies Bind 

Toxin A repeat regions to which neutralizing antibodies bind were identified bv 
utilizing recombinant toxin A repeat region proteins expressed as soluble or insoluble proteins 
to deplete protective antibodies from a polyclonal pool of antibodies against native ('. difficile 
toxin A. An /// vivo assay was developed to evaluate proteins for the ability to bind 
neutralizing antibodies. 

The rational for this assay is as follows. Recombinant proteins were first pre-mixed 
with antibodies against native toxin A (CTA antibody: generated in Kxample 8) and allowed 
to react. Subsequently. ('. difficile toxin A was added at a concentration lethal to hamsters 
and the mixture was administered to hamsters via IP injection. If the recombinant protein 
contains neutralizing epitopes, the CTA antibodies would lose their ability to bind toxin A 
resulting in diarrhea and/or death of the hamsters. 

The assay was performed as follows. The lethal dose of toxin A when delivered orally 
to nine 40 to 50 g Golden Syrian hamsters (Sasco) was determined to be 10 to 30 (.ig. The 
PFXi-purilled CTA antibody preparation was diluted to 0.5X concentration [i.e.. the antibodies 
were diluted at twice the original yolk volume) in 0.1 M carbonate buffer. pH 9.5. The 
antibodies were diluted in carbonate buffer to protect them from acid degradation in the 
stomach. The concentration of 0.5X was used because it was found to be the lowest effective 
concentration against toxin A. '['lie concentration of Interval 6-specific antibodies in the 0.5X 
CTA prep was estimated to be 10-15 ^ig/ml (estimated using the method described in 
Lxample 15). 
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The inclusion body preparation [insoluble Interval 6 protein: pPAI870-2680(H)] and 
ihe soluble Interval 6 protein (pMA 1 870-2680: see Figure 13] were both compared tor their 
ability io bind to neutralizing antibodies against C. difficile toxin A (CTA). Specifically. I to 
2 my of recombinant protein was mixed with 5 ml of a 0.5X CTA antibody prep (estimated 
lo contain 60-70 ug of Interval 6-specillc antibody). After incubation for I |, r at 37°C. CTA 
(Tech Lab) a. a final concentration of 30 ng/ml was added and incubated for another 1 hr at 
.>7°C One ml of this mixture containing 30 ug of toxin A (and 10-15 M g of Interval 6- 
specilie antibody, was administered orally to 40-50 g Golden Syrian hamsters (Sasco). 
Recombinant proteins that result in the loss of neutralizing capacity of the CTA an.ibodv 
would indicate that those proteins contain neutralizing epitopes. Prcimmune and CTA 
antibodies (both at 0.5X) without the addition of any recombinant protein served as negative 
;md positive controls, respectively. 

Two other inclusion body preparations, both expressed as insoluble products in the 
Pl-T vector, were tested: one containing a different insert (toxin B fraument) other than 
Interval 6 called P PB 1 850-2070 (see Figure .8) which serves as a control for insoluble 
Interv al 6. the other was a truncated version of the Interval 6 region called pl»A 1 870-2 1 90 
(see f igure I5B). The results of this experiment arc shown in Table 21. 

TABLE 21 



Treatment Group' 


miwi * ill \f | 

Healthy' 


wiciii .imuv WUttOllie 

Diarrhea'' 


A Her 24 Hours 
Dead' 


Pre immune Ab 


0 






CTA Ab 




1 


(i 


C TA Ab - int 6 (soluble) 


1 


2 


T 


CTA Ab - lot 6 (insoluble) 


S 


0 


0 


CTA Ab ■* P PB 1850-2070 


5 


0 


0 


CTA Ab * pPA I87Q-2 1 90 


5 


0 


0 



<'. diffidiv toxin A (CTA) was added to each group. 
Animals showed no siyns of illness. 
Animals developed diarrhea but did not die. 
Animals developed diarrhea and died. 

I'rcimmune antibody was ineffective against toxin A. while anti-CTA antibodies at a 
dilute 0.5X concentration almost completely protected the hamsters against the enterotoxic 
effects of CTA. The addition of recombinant proteins pPBl 850-2070 or pl>A 1870-2 190 to 
Hw anti-CTA antibody had no effect upon its protective ability, indicating thai these 
recombinant proteins do not hind to neutralizing antibodies. On .he other hand, recombinant 
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Interval 6 protein was able to bind to neutralizing anti-CTA antibodies and neutralized the in 
vivo protective effect of the anti-CTA antibodies. Tour out of live animals in the group 
treated with anti-CTA antibodies mixed with soluble interval 6 protein exhibited toxin 
associated toxicity (diarrhea and death). Moreover, the results showed that Interval 6 protein 
must be expressed as a soluble product in order for it to bind to neutralizing anti-C TA 
antibodies since the addition of insoluble Interval 6 protein had no effect on the neutralizing 
capacity of the CTA antibody prep. 

c) Determination Of Neutralization Ability Of Antibodies 
Raised Against Soluble And Insoluble Toxin A Repeat 
Immunogcn 

To determine if neutralizing antibodies are induced against solubilized inclusion 
bodies, insoluble toxin A repeat protein was solubilized and specific antibodies were raised in 
chickens. Insoluble pPA 1870-2680 protein was solubilized using the method described in 
Williams cf a/. {1995). supra. Briefly, induced cultures (500 ml) were pelleted bv 
ceniiifugution at 3.000 X g for 10 min at 4°(\ The cell pellets were resuspended thoroughly 
in 10 ml of inclusion body sonication buffer (25 mM HliPES pi I 7.7. 100 niM KC1. 12.5 mM 
MgCL 20% glycerol. 0,1% (v/v) Nonidet P-40. I mM DT I ). The suspension was transferred 
to a 30 ml non-glass centrifuge lube. I-'ive hundred M l of 10 mg/ml lysozyme was added and 
the tubes were incubated on ice for 30 min. The suspension was then frozen at -70°C for at 
least I nr. The suspension was thawed rapidly in a water bath at room temperature and then 
placed on ice. The suspension was then sonicated using at least eight 15 sec bursts of the 
microprobe (Branson Sonicator Model No. 450) with intermittent cooling on ice. 

I he sonicated suspension was transferred to a 35 ml Oakridgc tube and cemrifuged at 
6.000 X g for 10 min at 4°C to pellet the inclusion bodies. The pellet was washed 2 limes by 
pipetting or vortexing in fresh, ice-cold RIPA buffer |().1% SDS. 1% Triton X-100. 1% 
sodium deoxycholate in TBS (25 mM Tris-Cl pH 7.5. 150 mM NaCl)|. The inclusion bodies 
were recentrifuged after each wash. The inclusion bodies were dried and transferred usinu a 
small metal spatula to a 15 ml tube (falcon). One ml of 10% SDS was added and the pellet 
was solubilized by gently pipetting the solution up and down using a 1 ml micropipettor. The 
solubilization was facilitated by heating the sample to 95°C when necessary. 

Once the inclusion bodies were in solution, the samples were diluted with 9 volumes 
of PBS. The protein solutions were dialyzed overnight against a 1 00-fold volume ol" PBS 
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containing 0.05% SDS at room temperature. The dialysis buffer was then changed to PBS 
containing 0.01% SDS and the .samples were dialyzed for several hours to overnight at room 
temperature. The samples were stored at 4°C until used. Prior to further use. the samples 
were warmed to room temperature to allow any precipitated SDS to go back into solution. 

The inclusion body preparation was used to immunize hens. The protein was dialyzed 
into PBS and emulsified with approximately equal volumes of CPA for the initial 
imnu.niza.ion or IF A for subsequent booster immunizations. On day zero, (breach of the 
recombinant recombinant preparations, two egg laying white Leghorn hens were each injected 
«« multiple si.es (IM and SC, with 1 ml of recombinant protein-adjuvant mixture contains 
approximately 0.5-1.5 mg of recombinant protein. Booster immunizat.ons of I 0 mu were * 
given of days 14 and day 28. liggs were collected on day .12 and the antibodv isolated usin. 
H:C as described in Example 14(a). High titers of toxin A specific antibodies were present 
<as assayed by lil.lSA. using the method described in Example 1.1). Titers were determined 
l»r both an.ibod.es against recombinant polypeptides pPA 1 870-2680 and pMA 1 870-2680 and 
were found to be comparable at > 1:62.500. 

Antibodies against soluble Interval 6 (pMA 1 870-2680) and insoluble Interval 6 
Inclusion body,. pPAl 870-2680| were tested for neutralizing ability against ,oxin A usina 
the /„ r/,v, assay described in P.xample 15(b). Preimmune antibodies and antibodies against 
i«x.n A (C I A) served as negative and positive controls, respectively. The results are shown 
in Table 22. 



TABLE 22 



1 N "'" ; " i *" i "' °'' A B> Annbn.,,, Amnios. Soluble in,™,. * n,„,.„ ,,,,,„ 


Amibody Treatment Group 


Health) ' 


Diarrhea' 


Dead 1 


I'reimmunc 


1 


0 


A 


CTA 


5 


0 


0 


Interval 6 (Soluble)' 


s 


0 


0 


Interval 6 (Insoluble) I 0 1 


.) 



Animals showed no siim of illness. 
Animal developed diarrhea but did not die. 
Animal developed diarrhea and died 
•100 uu ml. 

Antibodies raised against native toxin A were protective while preimmune antibodies 
had h.tle effect. As found using the in vino CHO assay [described in Example 8(d)| where 
antibodies raised against the soluble Interval 6 could partially neutralize the effects of toxin A. 
here they were able to completely neutralize toxin A in vivo. |„ contrast, the antibodies 
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raised against the insoluble Interval 6 was unable to neutralize the effects of toxin A in vivo 
as shown above (Table 22) and in vitro as shown in the CHO assay [described in Example 
8(d)]. 

These results demonstrate that soluble toxin A repeat immunogen is necessary to 
5 induce the production of neutralizing antibodies in chickens, and that the generation of such 
soluble immunogen is obtained only with a specific expression vector (pMal) containing the 
toxin A region spanning aa 1870-2680. That is to say. insoluble protein thai is subsequently 
solubilized does not result in a toxin A antigen that will elicit a neutralizing antibody. 

•° EXAMPLE 18 

Cloning And Expression Of The C. difficile Toxin B Gene 



I he toxin B gene has been cloned and sequenced: the amino acid sequence deduced 
from the cloned nucleotide sequence predicts a MW of 269.7 k[) lor toxin B |Barroso et a/.. 
Nucl. Acids Res. 18:4004 (1990)). The nucleotide sequence of the coding region of the entire 
toxin \\ gene is listed in SEQ 10 NO:9. The amino acid sequence of the entire toxin B 
protein is listed in SEQ ID NO: 10. The amino acid sequence consisting of amino acid 
residues 1850 through 2360 of toxin B is listed in SEQ ID NO: I I. The amino acid sequence 
consisting of amino acid residues 1750 through 2360 of toxin B is listed in SEQ ID NO:12. 

(liven the expense and difficulty of isolating native toxin B protein, it would be 
advantageous to use simple and inexpensive procaryotic expression systems to produce and 
purify high levels of recombinant toxin B protein for immunization purposes, Ideallv. the 
isolated recombinant protein would be soluble in order to preserve native antigenicity, since 
solubili/.ed inclusion body proteins often do not fold into native conformations. Indeed as 
shown in Example 17. neutralizing antibodies against recombinant toxin A were only obtained 
when soluble recombinant toxin A polypeptides were used as the immunogen. To allow ease 
ol purification, the recombinant protein should be expressed to levels greater than I mg/liter 
of l\ co/i culture. 

Io determine whether high levels of recombinant toxin B protein could be produced in 
/:. coli. fragments of the toxin B gene were cloned into various prokaryotic expression 
vectors, and assessed for the ability to express recombinant toxin B protein in colt. This 
Example involved (a) cloning of the toxin B gene and (h) expression of the toxin B gene in 
/:. co/i. 
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a) Cloning Of The Toxin B Gene 

The toxin B gene was cloned using PCR amplification from C. difficile genomic DNA. 
Initially, the gene was cloned in two overlapping fragments, using primer pairs P5/P6 and 
P7/P8. The location of these primers along the toxin 13 gene is shown schematically in Figure 
18. The sequence of each of these primers is: P5: 5' TAGAAAAAATGGCAAATGT 3* (SEQ 
II) NO: 1 1 ): Pf>: 5" TTTCATCTTOTA GAGTCAAAG .V (SEQ ID N0:I2): 
P7: 5" GATGCCACAAGATGATTTAGTG .V (SEQ ID NO: 13): and 
P8: 5" C TAATTGAGCTGTATCAGGATC 3" (SEQ ID NO: 14). 

Figure 18 also shows the location of the following primers along the loxin B gene: P9 
which consists of the sequence 5" CGGAATTCCTAGAAAAAATGGCAA ATG y (SEQ ID 
NO: IS): Pl() which consists of the sequence 5" GCTCTAGAATGA CC AT A AGCTAGCCA 
3* (SEQ ID NO.I6): PI I which consists of the sequence 

5' CGGAATTCGAGTTGGTAGAAAGGTGGA .V (SEQ ID NO: 1 7): P13 which consists of 
Hie sequence y CGGAAT TCGGTTATTATCTTAAGGATG 3" (SEQ ID NO: 18): and PI4 
which consists of the sequence V CGGAATTCTTGATAACTGGAT TTGTGAC .V (.SEQ ID 
NO: 19). I he amino acid sequence consisting of amino acid residues 1852 through 2362 of 
toxin B is listed in SEQ ID NO:2(). The amino acid sequence consisting of amino acid 
residues 1755 through 2362 of toxin B is listed in SEQ ID N():21. 

Clostridium difficile VPI strain 10463 was obtained from the American Type Culture 
Collection </\TCC 43255) and grown under anaerobic conditions in brain-hean infusion 
medium (Becton Dickinson). High molecular-weighl C. difficile DNA was isolated essentially 
as described [Wren and Tabaqchali (1987) J. Clin. Microbiol.. 25:24021. except 1) 100 U g/ m | 
proteinase K in 0.5% SDS was used to disrupt the bacteria and 2) ce.y.rimethylammonium 
bromide (( TAB) precipitation jas described by Ausuhcl el a/.. Eds.. Current Pnuocols in 
Molecular Bi,,lo K v. Vol. 2 (1989) Current Protocols) was used to remove carbohydrates from 
the cleared lysatc. Briefly, after disrupiion of the bacteria with proteinase K and SDS. the 
solution is adjusted to approximately 0.7 M NaCI by the addition of a 1/7 volume of 5M 
NaCI. A 1/10 volume of CTAB/NaCl (10% CTAB in 0.7 M NaCI) solution was added and 
the solution was mixed thoroughly and incubated 10 min at 65°C. An equal volume of 
ehloroform/isoamyl alcohol (24:1) was added and the phases were thoroughly mixed. The 
organic and aqueous phases were separated by centrifugal ion in a microfuge for 5 min. The 
aqueous supernatant was removed and extracted with phenol/chloroform/ isoamyl alcohol 
(2^:24:1 ). The phases were separated by centrifugation in a microfuge for 5 min. The 
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supernatant was transferred to a fresh tube and the DNA was precipitated with isopropanol. 
The DNA precipitate was pelleted by brief centrifugal ion in a microfugc. The DNA pellet 
was washed with 70% elhanol to remove residual CTAU. The DNA pellet was then dried 
and redissolvcd in TK buffer (1(1 mM Tris-HCI pH8.0. I mM KDTA). The integrity and 
yield of genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA alter electrophoresis on an agarose gel. 

Toxin B fragments were cloned by PCR utilizing a proofreading thermostable DNA 
polymerase |native I'fi, polymerase (Slratagene)|. The high fidelity of this polymerase 
reduces the mutation problems associated with amplification by error prone polymerases (e.f-. 
/'<«/ polymerase). PCR amplification was performed using the PCR primer pairs P5 (SKO ID 
NO: 1 1 ) with |>6 (SIIQ ID N():12) and l>7 (Sl-Q ID N0:I3) with P8 (Sl-Q ID N0:I4> in 50 
Hi reactions containing 10 mM Tris-HCI pH8.3. 50 mM KCI. 1.5 mM MgCI.. 200 ^M of 
each dNTP. 0.2 uM each primer, and 50 ng C. difficile genomic DNA. Reactions were 
overlaid with 100 ul mineral oil. healed to 94°C for 4 min. 0.5ul native Pfn polymerase 
(Stratagem.-) was added, and the reactions were cycled 30 times at l M°C for 1 min. 50°C for | 
mm. 72"C (2 min for each kb of sequence to be amplified), followed by 10 min at 72°C. 
Duplicate reactions were pooled, chloroform extracted, and ethanol precipitated. After 
washing in 70% ethanol. the pellets were resuspended in 50 (il TK buffer (10 mM Tris-HCI 
pH8.0. I mM HDTA). 

The P5/P6 amplification product was cloned into pUCI') as outlined below. I(>ul 
aliquots of DNA were gel purified using the Prep-a-(iene kit (BioRad). and ligated to Smcl 
restricted pUCl') vector. Recombinant clones were isolated and confirmed hv restriction 
digestion using standard recombinant molecular biology techniques (Sambrook <•/ ,//.. 1 0X9). 
Inserts from two independent isolates were identified in which the toxin B insert was oriented 
such that the vector /Will and Sad sites were 5' and 3" oriented, respectively (pUCBIO- 
1530). The insert-containing BamHUSucl fragment was cloned into similarly cut pHT23a-e 
vector DNA. and protein expression was induced in small scale cultures (5 ml) of identified 
clones. 

Total protein extracts were isolated, resolved on SDS-PACiL gels, and toxin B protein 
identified by Western analysis utilizing a goat anti-toxin B affinity purified antibody (Tech 
Lab). Procedures for protein induction. SDS-PAGL. and Western blot analysis were 
performed as described in Williams ct ul. (1W5). .«/,»•«. In brief. 5 ml cultures of bacteria 
grown in 2XYT containing 100 ug/ml ampicillin containing the appropriate recombinant clone 
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were induced to express recombinant protein by addition of FPTG to ImM. The E. coli hosts 
used were: BL2KDE3) or BL21(DE3)LysS (Novagen) for pET plasmids. 

Cultures were induced by the addition of IPTG to a final concentration of 1.0 mM 
when the cell density reached 0.5 OD WM) . and induced protein was allowed to accumulate for 
two h,s after induction. Protein samples were prepared by pelleting I ml aliquots of bacteria 
by eentrifugation ( I min in microfuge). and resuspension of the pelleted bacteria in 150 ul of 
2X SDS-PAGE sample buffer (0.125 mM Tris-HCI pH 6.8. 2 mM EDTA. 6% SDS. 20% 
glycerol. 0.025% bromophenol blue: p-mercaptoethanol is added to 5% before use). The 
samples were heated to «5°C for 5 min. then cooled and 5 or 10 uls loaded on 7.5% SDS- 
PAGE gels. High molecular weight protein markers (BioRad) were also loaded, to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected either generally by staining the gels with Coomassie Blue, or specifically, by blotting 
to nitrocellulose for Western blot detection of specific immunoreactive protein. The MW of 
induced toxin B reactive protein allowed the integrity of the toxin I) reading frame to be 
determined. 

The P IT23b recombinant (pPB 10-1530) expressed high MW recombinant toxin B 
reactive protein, consistent with predicted values. This confirmed that frame terminating 
errors had no, occurred during PCR amplification. A P ET23b expression clone containing the 
H)-1750aa interval of the toxin B gene was constructed, by fusion of the EcoRV-Spel 
fragment of the P7/P8 amplification product to the P>£«,RV interval of the P5/P6 
amplification product (see figure 18) in pPBIO-1530. The integrity of this clone <pPB10- 
1750) was confirmed by restriction mapping, and Western blot detection of expressed 
recombinant toxin R protein. Levels of induced protein from both pPBIO-1530 and pPBlO- 
1750 were too low to facilitate purification of usable amounts of recombinant toxin B protein. 
The remaining 1750-2360 aa interval was directly cloned into pMAL or pET expression 
vectors from the P7/P8 amplification product as described below. 

n) Expression Of The Toxin I! Gene 

i) Overview Of Expression Methodologies 

Fragments of the toxin B gene were expressed as either native or fusion proteins in E. 
coli. Native proteins were expressed in either the P ET23a-c or pETI6b expression vectors 
(Novagen). The pET23 vectors contain an extensive polylinker sequence in all three reading 
frames (a-c vectors) followed by a C-terminai ooly-histidine repeal. The P ET16b vector 
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contains a N-terminal poly-histidine sequence immediately 5' to a small polylinker. The 
poly-histidine sequence binds to Ni-Chelate columns and allows affinity purification of tagged 
target proteins [Williams et al. (1995), supra]. These affinity tags arc small (10 aa for 
pET16b. 6 aa for pET23) allowing the expression and affinity purification of native proteins 
5 with only limited amounts of foreign sequences. 

An N-terminal histidine-tagged derivative of pETI6b containing an extensive cloninu 
cassette was constructed to facilitate cloning of N-terminal poly-histidine tagged toxin B 
expressing constructs. This was accomplished by replacement of the promoter region of the 
pET23a and b vectors with that of the pET16b expression vector. Each vector was restricted 

10 with Bi:IU and AVAd. and the reactions resolved on a 1.2 % agarose gel. The pET16b 

promoter region (contained in a 200 bp BxlU-Ndel fragment) and the promoter-less pET23 a 
or b vectors were cut from the gel. mixed and Prep-A-(ienc ( BioRad) purified. The eluted 
DNA was ligated. and transformanis screened lor promoter replacement by Aod digestion of 
purified plasmid DNA (the pET16b promoter contains this site, the pET23 promoter does 

1 5 not). These clones (denoted pETHisa or b) contain the pETlfib promoter (consisting of a 

pI7-lac promoter, ribosome binding site and poly-histidine (lOaa) sequence) fused at the Nclc\ 
site to the extensive pET23 poiylinker. 

All MBP fusion proteins were constructed and expressed in the pMAI. m -c or 
pMAL. IM -p2 vectors (New England Biolabs) in which the protein of interest is expressed as a 

20 C-ierminal fusion with MBP. All pET plasmids were expressed in either the B1.21(I)E3) or 
BL21(I)E3)I.ysS expression hosts, while pMal plasmids were expressed in the 131.21 host. 

Large scale (500 mis to I liter) cultures of each recombinant were grown in 2X YT 
broth, induced, and soluble protein fractions were isolated as described | Williams, el al. 
(1995). .viy;/v/|. The soluble protein extracts were affinity chromatographed to isolate 

25 recombinant fusion protein, as described (Williams et «/.. (1995) supra]. In brief, extracts 

containing tagged pET fusions were chromatographed on a nickel chelate column, and eluted 
using imidazole salts or low pi I (pi I 4.0) as described by the distributor (Novagcn or Qiagen). 
Extracts containing soluble pMAL fusion protein were prepared and chromatographed in PBS 
buffer over an amy lose resin (New England Biolabs) column, and eluted with PBS containing 

30 10 mM maltose as described (Williams et al. (1995). supra\. 
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ii) Overview Of Toxin B Expression 

In both large expression constructs described in (a) above, only low level (i.e.. less 
lhan I mg/liter of intact or nondegraded recombinant protein) expression of recombinant 
protein was detected. A number of expression constructs containing smaller fragments of the 
toxin B gene were then constructed. 10 determine if small regions of the gene can be 
expressed to high levels </.«,.. greater than I mg/liter intact protein) without extensive protein 
degradation. All were constructed by in frame fusions of convenient toxin B restriction 
fragments ,n either the pMAL-c. P LT23a-c. pLT16b or pETIIisa-b expression vectors, or bv 
engineering restriction sites at specific locations using PCR amplification [usinu the same 
conditions described in (a) above|. In all cases, clones were verified by restriction mapping, 
and. where indicated. DNA sequencing. 

Protein preparations Iron, induced cultures of each of these constructs were analyzed. 
h.v SDS-PACil-. to estimate protein stability (Coomassie Blue staining) and immunoreaenvi.v 
uguinsi : ,nti-t„xin B specific antiserum ( Western analysis). I liyher levels of intact </.e. . 
nondegraded). full length fusion proteins were observed with the smaller constructs as 
compared with fhc larger recombinants, and a series of expression constructs spanning the 
entire toxin U gene were constructed (figures 18. 19 and 20 and Table 23). 

Constructs that expressed significant levels of recombinant toxin B protein (greater 
than I mg/liter intact recombinant protein) in E. coli were identified and a series of these 
clones that spans the toxin B gene are shown in Figure 19 and summarized in Table 23. 
These clones were utilized to isolate pure toxin B recombinant protein from the entire toxm B 
gene. Significant protein yields were obtained from pMAL expression constructs spannine the 
entire toxin B gene, and yields of full length soluble fusion protein ranged from an estimated 
1 mg/liter culture (pMBI 100-1530) to greater than 20 mg/liter culture (pMBI 750-2360). 

Representative purifications of MBP and poly-histidinc-tagged toxin B recombinants 
are shown in Figures 21 and 22. Figure 21 shows a Coomassie Blue stained 7.5% SDS- 
PAGL gel on which various protein samples extracted from bacteria harboring pMBI850- 
2360 were electrophoresed. Samples were loaded as follows: Fane I: protein extracted from 
uninduced culture: Lane 2: induced culture protein: Lane 3; total protein from induced culture 
alter sonication: Lane 4: soluble protein: and Lane 5: eluted affinity purified protein. Figure 
22 depicts the purification of recombinant proteins expressed in bacteria harboring either" 
pPBI850.236(MLones 1-3) or P PB.750-2360 (Lanes 4-6). Samples were loaded as follows- 
uninduced total protein (Lanes 1 and 4): induced total protein (Lanes 2 and 5): and eluted 
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affinity purified protein (Lanes 3 and 6). The broad range molecular weight protein markers 
(BioRad) are shown in Lane 7. 

Thus, although high level expression was not attained using large expression constructs 
from the toxin B gene, usable levels of recombinant protein were obtained by isolating 
induced protein from a series of smaller pMAL expression constructs that span the entire 
toxin B gene. 

These results represent the first demonstration of the feasibility of expressing 
recombinant toxin B protein to high levels in E. coli. As well, expression of small regions of 
the putative ligand binding domain (repeal region) of toxin B as native protein yielded 
insoluble protein, while large constructs, or fusions to MBP were soluble (Figure 19), 
demonstrating that specific methodologies are necessary to produce soluble fusion protein 
from this interval. 



iii) Clone Construction And Expression Details 

A portion of the toxin B gene containing the toxin B repeat region |amino acid 
residues 1852-2362 of toxin B (SHQ ID NO:20)| was isolated by PCR amplification of this 
interval of the toxin B gene from ('. difficile genomic ON A. The sequence, and location 
within the toxin B gene, of the two PCR primers |P7 (SEQ ID NO: 13) and P8 (SKQ ID 
NO:l4)| used to amplify this region are shown in Figure 18. 

DNA from the PCR amplification was purified by chloroform extraction and elhanol 
precipitation as described above. The DNA was restricted with Spvl. and the cleaved DNA 
was resolved by agarose gel electrophoresis. The restriction digestion with Spe\ cleaved the 
3.6 kb amplification product into a 1.8 kb doublet band. This doublet band was cut from the 
gel and mixed with appropriately cut. gel purified pMALc or pFT23b vector. These vectors 
were prepared by digestion with ///Will, filling in the overhanging ends using the Klenow 
enzyme, and cleaving with Xha\ (pMALc) or Nhd (pET23b). The gel purified DNA 
fragments were purified using the Prep-A-Gene kit (BioRad) and the DNA was ligated. 
transformed and putative recombinant clones analyzed by restriction mapping. 

pl*T and pMal clones containing the toxin B repeat insert (aa interval 1750-2360 of 
toxin B) were verified by restriction mapping, using enzymes that cleaved specific sites within 
the toxin B region. In both cases fusion of the toxin B Spc\ site with either the compatible 
Xhal site (pMal) or compatible XIwl sile (pET) is predicted to create an in frame fusion. This 
was confirmed in the case of the pMB 1 750-2360 clone by DNA sequencing of the clone 
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junction and 5* end of the toxin B insert using a MBP specific primer (New England 
Biolabs). In the case of the pET construct, the fusion of the blunt ended toxin B 3' end to 
the tilled HindlU site should create an in-frame fusion with the C-tcrminal poly-histidine 
sequence in this vector. The pPB 1750-2360 clone selected had lost, as predicted, the Hind\\\ 
site at this clone junction: this eliminated the possibility that an additional adenosine residue 
was added to the 3' end of the PCR product by a terminal transferase activity of the Pju 
polymerase, since fusion of this adenosine residue to the tilled HinM\ site would regenerate 
the restriction site (and was observed in several clones). 

One liter cultures of each expression construct were grown, and fusion protein purified 
by affinity chromatography on either an amylose resin column (pMAL constructs: resin 
supplied by New England Biolabs) or Ni-chelate column (pET constructs: resin supplied by 
Qiagen or Novagen) as described | Williams a at. (1995). supra]. The integrity and purity of 
Hie fusion proteins were determined by Coomassie staining of SDS-PAGE protein gels as well 
as Western blot analysis with either an affinity purified goat polyclonal antiserum ( Tech Lab), 
or a chicken polyclonal PEG prep, raised against the toxin B protein (CTB) as described 
above in Example 8. In both cases, affinity purification resulted in yields in excess of 20 mu 
protein per liter culture, of which greater than 90% was estimated to be full-length 
recombinani protein. It should he noted that the poly-histidine affinity tagged protein was 
released from ihc Qiagen Ni-NTA resin at low imidazole concentration (60 mM). 
necessitating the use of a 40 mM imidazole rather than a 60 mM imidazole wash step during 
purification. 

A periplasmically secreted version of pMBl 750-2360 was constructed by replacement 
of the promoter and MBP coding region of this construct with that from a related vector 
(pMAI. IM -p2: New England Biolabs) in which a signal sequence is present at the N-ierminus 
of the MBP. such that fusion protein is exported. This was accomplished by substituting a 
B.alU-LcoRV promoter fragment from P MAL.-p2 into pMBl 750-2360. The yields of secreted, 
affinity purified protein (recovered from osmotic shock extracts as described by Riggs in 
Cttmni Protocols in Molecular Biology. Vol. 2. Ausubel. el «/.. Eds. (1989). C urrent 
Protocols, pp. 16.6.1 - 16.6.14] from this vector (pMBpl 750-2360) were 6.5 mg/liter culture, 
of which 50% was estimated to be full-length fusion protein. 

Ihc interval was also expressed in two non-overlapping fragments. pMB! 750-1970 
was constructed by introduction of a frameshift into pMB 1 750-2360. by restriction with 
IfindlU. filling in the overhanging ends and religalion of the plasmid. Recombinani clones 
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were selected by loss of the HirtMl site, and further restriction map analysis. Recombinant 
protein expression from this vector was more than 20 mg/liter of greater than 90% pure 
protein. 

The complementary region was expressed in pMB 1 970-2360. This construct was 
created by removal of the 1750-1970 interval of pMR 1 750-2360. This was accomplished by 
restriction of this plasmid with EcoRl (in the pMalc polylinker 5" to the insert) and III. filling 
in the ov erhanging ends, and religation of the plasmid. The resultant plasmid. pMB 1 970-2360. 
w as made using both inlracellularly and secreted versions of the pMB 1 750-2360 vector. 

No fusion protein was secreted in the pMBpl 970-2360 version, perhaps due to a 
conformational constraint that prevents export of the fusion protein. However, the 
inlracellularly expressed vector produced greater than 40mg/liter of greater than 90% full- 
length fusion protein. 

C onstructs to precisely express the toxin H repeals in either pMalc (pMBl 850-2360) or 
pl-TI6h (pl>B 1 850-2360) were constructed as follows. The DNA interval including the toxin 
B repeats was PCR amplified as described above utilizing PCR primers IM4 (SKQ II) NO:!9) 
and PX (SKQ ID NO: 14). Primer P14 adds a Ka,R\ site immediately Hanking the start of the 
toxin B repeats. 

The amplified fragment was cloned into the pT7 Blue T- vector (Novagen) and 
recombinant clones in which single copies of the PCR fragment were inserted in either 
orientation were selected ( P T71 850-2360) and conf.rmed by restriction mapping. The insert 
was excised from two appropriately oriented independently isolated pT7 1 850-2360 plasmids. 
with liwRl (5- end of repeats) and !>xt\ (in the Hanking polylinker of the vector), and cloned 
into KcoRVP.srt cleaved pMalc vector. The resulting construct (pMB 1 850-2360) was 
confirmed by restriction analysis, and yielded 20 mg/1 of soluble fusion protein [greater than 
90% intact (/.<?.. nondegraded)J after affinity chromatography. Restriction of this plasmid with 
Hiiul\\\ and religation of the vector resulted in the removal of the 1970-2360 interval. The 
resultant construct ( P MBI850-I970) expressed greater than 70 mg/liter of 90% lull length 
fusion protein. 
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The pPBI 850-2360 construct was made by cloning a EcoRl (filled with Klenow)- 
BamHl fragment from a pT7 1850-2360 vector (opposite orientation to that used in the 
pMB 1850-2360 construction) into Mel (MedySamHl cleaved pET16b vector. Yields of 
affinity purified soluble fusion protein were 15 mg/li.er. of greater than 90% full | ength 
fusion protein. 

Several smaller expression constructs from the 1750-2070 interval were also 
constructed in Mis-tagged pET vectors, but expression of these plasmids in the BI.21 (DE3) 
host resulted in the production of high levels of mostly insoluble protein (sec Table 23 and 
Figure 19). These constructs were made as follows. 

PPBI850-I970 was constructed by cloning a Bf>l\\-Hmc{l\\ fragment of pPB 1850-2360 
into BamiHimm cleaved P LT23b vector. pPBI 850-2070 was constructed by cloning a 
/k/II-/V„II fragment of pPBI 850-2360 into B^lWHincU cleaved pET23b vector. pPBI750- 
1970(c) was constructed by removal of the internal HindXU fragment of a pPBI 750-2360 
vector in which the vector HincAU site was regenerated during cloning (presumably bv the 
addition of an A residue to the amplified PCR product by terminal transferase activity of Pfi, 
polymerase). The P PBI750-I970(n) construct was made by insertion of the insert containing 
the Nc/el-Hinc/Ul fragment of pPB 1 750-2360 into identically cleaved pETHisb vector. All 
constructs were confirmed by restriction digestion. 

An expression construct that directs expression of the 10-470 aa interval of toxin B 
was constructed ,n the pMalc vector as follows. A Nhel (a site 5" to the insert in the pET23 
vector, ,^//|| ( n || ed) {ragmenI ofthe loxm g gene ffom pPBI(M530 ^ ^ 

U-omnaiible with SMVHindM (filled, pMalc vector. The integrity of the construct (pMBIO- 
470) was verified by restriction mapping and DNA sequencing ofthe 5" clone junction using 
a MBP specific DNA primer ,New England Biolabs). However, all expressed protein was 
degraded to the MBP monomer MW. 

A second construct spanning this interval (aa 10-470) was constructed by cloning the 
PCR amplification product from a reaction containing the P9 (SEQ ID NO: 1 5) and P10 (SFQ 
ID N():16, primers (Figure .8) into the pETHisa vector. This was accomplished bv cloning 
the PCR product as an /;„,RI-blunt fragment into EnM-Hinc\\ restricted vector DNA: 
recombinant clones were verified by restriction mapping. Although this construct (pPBIO- 
520) allowed expression and purification (utilizing the N-lerminal polyhistidine affinity tau, of 
intact fusion protein, yields were estimated at less than 500 ug per liter culture. 
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Higher yield of recombinant protein from this interval (aa 10-520) were obtained by 
'expression of the interval in two overlapping clones. The IO-330aa interval was cloned in 
both pETHisa and pMalc vectors, using the BamH\-Ajl\\\ (filled) DNA fragment from pPBlO- 
520. This fragment was cloned into BamVW-HindlU (filled) restricted pMalc or BamHl-Uindl 
restricted pETHisa vector. Recombinant clones were verified by restriction mapping. High 
level expression of either insoluble (pET) or soluble (pMal) fusion protein was obtained. Total 
yields of fusion protein from the pMB 10-330 construct (Figure IX) were 20 mg/liter culture, 
of which 10% was estimated to be full-length fusion protein. Although yields of this interval 
were higher and >W% full-length recombinant protein produced when expressed from the 
pET construct, the pMal fusion was utilized since the expressed protein was soluble and thus 
more likely to have the native conformation. 

The pMB260-520 clone was constructed by cloning KcoRUXhal cleaved amplified 
DNA from a PCR reaction containing the PI 1 (SEQ II) N():I7) and PI0 (SEQ ID NO: I ft) 
DNA primers (figure 18) into similarly restricted pMalc vector. Yields of affinity purified 
protein were 10 mg/liter. of which approximately 50% was estimated to be lull-length 
recombinant protein. 

I he aaSIO-1 1 10 interval was expressed as described below. This entire interval was 
expressed as a pMal fusion by cloning the Nhe\-fIindU\ fragment of pUCBlO-1530 into Xha\- 
flindUl cleaved pMalc vector. The integrity of the construct (pMBSK)-l 110) was verified by 
restriction mapping and DNA sequencing of the 5' clone junction using a MBP specific DNA 
primer. The yield of affinity purified protein was 25 mg/liter culture, of which 5% was 
estimated to be full-length fusion protein (1 mg/liter). 

fo attempt to obtain higher yields, this region was expressed in two fragments (aa510- 
820. and 820-1 1 10) in the pMalc vector. The piMB5 1 0-820 clone was constructed by 
insertion of a Sad (in the pMalc polylinkcr 5* to the insert )-Hpu\ DNA fragment from 
pMBSKM 1 10 into Sad/St al restricted pMalc vector. The pMB820-l 1 10 vector was 
constructed by insertion of the HpaUIIinttUl fragment of pUCBlO-1530 into JiumHl (filled)/ 
Hinc/lU cleaved pMalc vector. The integrity of these constructs were verified by restriction 
mapping and DNA sequencing of the 5' clone junction using a MBP specific DNA primer. 

Recombinant protein expressed from the pMB5l()-820 vector was highly unstable. 
However, high levels (20 mg/liter) of > l X)% full-length fusion protein were obtained from the 
pMB820-l 105 construct. The combination of partially degraded pMBSKM 1 10 protein 
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(enriched for the 510-820 interval) with the pMB820-l , 10 protein provides usable amounts of 
recombinant antigen from this interval. 

The aal 100-1750 interval was expressed as described below. The entire interval was 
expressed in .he pMalc vector from a construct in which the ,4cd(filled)-.S)*I fragment of 
pPBIO-1750 was inserted into S,u\IXba\ (Xbal is compatible with Spcl: Sml and filled Acc\ 
•sites are both blunt ended) restricted pMalc. The integrity of this construct (pMBl 100-1750) 
was verified by restriction mapping and DNA sequencing of the clone junction using a MBP 
•specie DNA primer. Although 15 mg/liter of affinity purified protein was isolated from cells 
harbonng tins construct, the protein was greater than 99% degraded ,o MBP monomer sbe 
A smaller derivative of pMBl 100-1750 was constructed bv restriction of pMBl 100- 
1750 with ,//., and *,/> (i „ the pMalc poly.inker .V to the insert,, fil.ing in the overhanging 
ends, and rehgaung the plasmid. The resultant clone (verified by restriction diuestion and 
DNA sequencing) has deleted .he aa!530-1750 interval, was designated P MBI 100-1530 
PMBl 100-1530 expressed recombinant protein a, a yield of greater than 40 me/litcr. of which 
was estimated lo be full-length fusion protein. 

Three constructs were made to express the remaining interval. Initially, a B, P H\ 
(filled)-*),,! fragment from pPB10-1750 was cloned into &»R|<filledJ/.Vfc,l cleaved pMalc 
vector. The integrity of .his construct (pMB1570-l 750) was verified by restriction mapping 
and DNA sec,uencing of the 5' Cone junction using a MBP specific DNA primer. Impression 
"f recombmam protein from this plasmid was very low. approximately 3 mg affinitv purified 
protein per h.er. and most was degraded to MBP monomer size. This reuion was 
subsequently expressed from a PCR amplified DNA fragment. A PCR reaction utilizing 
prune* P.3 |SEQ ID N():, 8: P,3 was engineered to introduce an EcoRI site 5" to amplified 
u«.n B sequencesl and P8 (SEQ «D N():.4) was performed on C. genomic DNA as 

described above. The amplified fragment was cleaved with £„,R| and S M . and Coned into 
fec»RI/.\fc,| cleaved pMalc vector. The resultant clone ( P MBI530-1 750) was verified bv 
restriction map analysis, and recombinant protein was expressed and purified. The vicld was 
greater than 20 mg protein per liter culture and it was estimated that 25% was full-lenuth 
lusmn protein: this was a significan.ly higher yield than the original pMBl S7CM7SO clone 
The msert of pMB.530-,750 (in a E,„R,-.Wl fragment, was transferred to the pETHisa 
vector tfuMXhol cleaved. M and ends are compatible,. No detectable fusion 
protem was purified on Ni-Chelate columns from soluble lysates of cel.s induced to express 
lusion protein from this construct. 
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TABLE 23 

Summary Of Toxin B Expression Constructs' 1 



10 



20 



25 



30 



C lone 


Affinity Tai: 


Yield (m»/liter) 


% l-'ull Length 


pPBIO-1750 


none 


low (estimated from Western 
blot hyb.) 


■) 


pPB! 0-1530 


none 


low (as above) 


'> 


pMB 10-470 


MBP 


15my/l 


0% 


pPB 10-320 


poly-his 


0.5m»'l 


20% 


pPBI0-3.i0 


poly-his 


•20mu/l (insoluble) 


90% 


p\l BIO- 3 30 


MBP 


20m&i 


10% 


pMB260-520 


MBP 




50% 


pMIsS lO-l 1 10 


MBP 


25m$/l 


5% 


pM 1*510-820 


MBP 


degraded (by Western blot 
hyb) 




p\tbX20-IIIU 


MBP 


20m\><I 


*M)% 


pMBl 100-1750 


MBP 


1 5mi»/l 


0% 


pMBI 100-1530 


MBP 


40m&l 


5% 


pMB1570-l750 


MBP 


3mu/l 


■ 5% 


pPB 1530- 1750 


poly-his 


no purified protein detected 


•» 


p\1BI53O~l"50 


MBP 


20m$/l 


25% 


p.Vlil '50-2360 


MBP 


>20m\*<l 




pMBp 1750-2360 


MBP 


6.5mg/l (secreted) 


50% 


pPB 1750-2360 


poly-his 


■■•20mg/l 


90% 


pMB 1750- 1970 


MBP 


■20mg'l 


■90% 


pMB 1070-2360 


MBP 


40mii/l 


90% 


pMBpl97()-2360 


MBP 


(no secretion) 


NA 


pMB 1850-2360 


MBP 


20mu/l 


90% 


pPB 1850-2360 


poly-his 


I5mg/I 


■90% 


pMB 1850- 1970 


MBP 


70m »/l 


•90% 


pPB 1850- 1970 


poly-his 


HOmg'l (insoluble) 


•90% 


pPB 1850-2070 


poly-his 


•lOmiv'l (insoluble) 


90% 


pPB 1750- 1970(f) 


poly-his 


I0m$»/1 (insoluble) 


•90% 


pPB I750-I970(n) 


poly-his 


•I0mg/l (insoluble) 


90% 



Clones in italics arc clones currently utilized to purify recombinant protein from each selected 
interval. 

35 
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occurs with the CTB antibody-recombinant mixture, that recombinant contains only weak or 
non-neutralizing epitopes of toxin B. This assay was performed as follows. 

Antibodies against CTB were generated in egg laying Leghorn hens as described in 
Example 8. The lethal dosage (LD m ) of C. difficile toxin D when delivered I.P. into 40g 
female Golden Syrian hamsters (Charles River) was determined to be 2.5 to 5 ug. Antibodies 
generated against CTB and purified by PEG precipitation could completely protect the 
hamsters a. the I.P. dosage determined above. The minimal amount of CTB antibodv needed 
to afford good protection against 5 ug of CTB when injected I.P. into hamsters was also 
determined (IX PEG prep). These experiments defined the parameters needed to test whether 
;> given recombinant protein could deplete protective CTB antibodies. 

The cloned regions tested for neutralizing ability cover the entire toxin B gene and 
were designated as Intervals (INT) I through 5 (see Figure 19). Approximately equivalent 
final concentrations of each recombinant polypeptide were tested. The following recombinant 
polypeptides were used: I ) a mixture of intervals I and 2 (INT- 1. 2); 2) a mixture of 
Intervals 4 and 5 (INT-4. 5, and 3) Interval 3 (INT-3). Recombinant proteins (each at about 
I mg total protein) were first preincubated with a final CTB antibody concentration of IX 
\U:. pellet dissolved in original yolk volume as described in Example l(c)| in a final volume 
of 5 mis for I hour a, 37°C. Twenty-five ug of CTB (at a concentration of 5 MB /ml>. enough 
CTB to kill 5 hamsters, was then added and the mixture was then incubated for I hour at 
•W. Five. 40g female hamsters (Charles River) in each treatment group were then each 
given I ml of the mixture I.P. using a tuberculin syringe with a 27 gauge needle. The results 
of this experiment are shown in "fable 24. 





TABLE 24 




Binding 


Of Neutralizing Antibodies Bv INT : 


> Protein 


Treatment Group' 


Number Of Animals Alive 


Number Of Animals Dead 


CTB antibodies 


3 


-> 


C TB antibodies - INTL2 


3 


2 


CTB antibodies * INT4.5 


3 


•*> 


CTB antibodies + INT 3 


0 


s 



C t/i//idlc loxin B (CTB) was added to each group. 



As shown in Table 24. the addition of recombinant proteins from INT- 1. 2 or INT-4 5 
had no effect on the /„ ,* w protective ability of the CTB antibody preparation compared to 
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10 



20 



2> 



EXAMPLE 19 

Identification." Purification And Induction Of Neutralizing 
Antibodies Against Recombinant C. difficile Toxin B Protein 

To determine whether recombinant toxin B polypeptide fragments can generate 
neutralizing antibodies, typically animals would first be immunized with recombinant proteins 
and anti-reeombinant antibodies arc generated. These anli-recombinant protein antibodies are 
then tested for neutralizing ability hi vivo or in vitro. Depending on the immunogenic nature 
of the recombinant polypeptide, the generation of high-tiler antibodies against that protein 
may take several months. To accelerate this process and identify which recombinant 
polypeptide! s) may he the best candidate to generate neutralizing antibodies, depletion studies 
were performed. Specifically, recombinant toxin B polypeptide were pre-screened by testinu 
whether they have the ability to bind to protective antibodies from a CTB antibody 
preparation and hence deplete those antibodies of their neutralizing capacity. Those 
recombinant polypeptides found to bind CTB, were then utilized to generate neutralizing 
antibodies. This Lxample involved: a) identification of recombinant sub-regions within toxin 
B to which neutralizing antibodies bind: b) identification of toxin B sub-region specific 
antibodies that neutralize toxin R in vivo: and c) generation and evaluation of antibodies 
reactive to recombinant toxin B polypeptides. 

a) Identification Of Recombinant Sub-Regions Within Toxin U 
To Which Neutralizing Antibodies Bind 

Sub-regions within toxin B to which neutralizing antibodies bind were identified by 
utilizing recombinant toxin B proteins to deplete protective antibodies from a polyclonal pool 
of antibodies against native ( '. difficile toxin B. An in vivo assay was developed to evaluate 
protein preparations for the ability to bind neutralizing antibodies. Recombinant proteins were 
first pre-mixed with antibodies directed against native toxin B (CTB antibody; see Hxample 8) 
and allowed to react for one hour at 37°C\ Subsequently. C. difficile toxin B (CTB: Tech 
Lab) was added at a concentration lethal to hamsters and incubated for another hour at 37°C. 
After incubation this mixture was injected intraperitoneally (IP) into hamsters. If the 
recombinant polypeptide contains neutralizing epitopes, the CTB antibodies will lose its 
ability to protect the hamsters against death from CTB. If partial or complete protection 
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the CTB antibody preparation alone. In contrast. INT-3 recombinant polypeptide was able to 
remove all of the toxin B neutralizing ability of the CTB antibodies as demonstrated by the 
death of all the hamsters in that group. 

The above experiment was repeated, using two smaller expressed fragments (pMB 
1750-1970 and pMB 1970-2360. see Figure 19) comprising the INT-3 domain to determine if 
that domain could be further subdivided into smaller neutralizing epitopes. In addition, full- 
length INT-3 polypeptide expressed as a nickel tagged protein (pPBl 750-2360) was tested for 
neutralizing ability and compared to the original INT-3 expressed MBP fusion (pMBI750- 
2360). Che results are shown in Table 25. 

TABLE 25 



Treatment Group' 


- • tw^^.m V~\JI HUH 1 1 

Number Of Animals Alive 


Number Of Animals IJead 


CTB antibodies 


5 


0 


CTB antibodies ♦■ pPB 1750-2360 


0 


5 


CTB antibodies ♦ pMB 1750-2360 


0 


5 


CTB antibodies • pMBI970-2360 


3 


■> 


CTB antibodies pMB 1750- 1970 


2 





( ' dillkilc mxin 11 (CTB) was added to each group. 



The results summarized in Table 25 indicate that the smaller polypeptide fragments 
within the INT-3 domain. pMB 1 750-1970 and pMB 1970-2360. partially lose the ability to 
hind to and remove neutralizing antibodies from the CTB antibody pool. These results 
demonstrate that the full length INT-3 polypeptide is required to completely deplete the CTB 
antibody pool of neutralizing antibodies. This experiment also shows that the neutralization 
epitope of INT-3 can be expressed in alternative vector systems and the results are 
independent of the vector utilized or the accompanying fusion partner. 

Other Interval 3 specific proteins were subsequently tested for the ability to remove 
neutralizing antibodies within the CTB antibody pool as described above. The Interval 3 
specific proteins used in these studies are summarized in Figure 23. In Figure 23 the 
following abbreviations arc used: pl> refers to the pET23 vector: pM refers to the pMALc 
vector: B refers to toxin B: the numbers refer to the amino acid interval expressed in the 
clone. The solid black ovals represent the MBP: and HUH represents the poly-histidine tag. 
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Only recombinant proteins comprising the entire toxin B repeat domain (pMB1750- 
2360. pPB 1750-2360 and pPB 1850-2360) can bind and completely remove neutralizing 
antibodies from the CTB antibody pool. Recombinant proteins comprising only a portion of 
the toxin B repeat domain were not capable of completely removing neutralizing antibodies 
5 from the CTB antibody pool (pMB 1750- 1970 and pMB 1970-2360 could partially remove 
neutralizing antibodies while pMBI850-1970 and pPB 1850-2070 failed to remove any 
neutralizing antibodies from the CTB antibody pool). 

The above results demonstrate that only the complete ligand binding domain (repeat 
region) of the toxin B gene can bind and completely remove neutralizing antibodies from the 
10 CTB antibody pool. These results demonstrate that antibodies directed against the entire toxin 
B repeat region are necessary for in vivo toxin neutralization (see Figure 23; only the 
recombinant proteins expressed by the pMBl 750-2360. pPB 1750-2360 and pPB 1 850-2360 
vectors are capable of completely removing the neutralizing antibodies from the CTB 
antibody pool). 

\> These results represent the first indication that the entire repeal region of toxin B 

would he necessary for the generation of antibodies capable of neutralizing toxin B. and that 
sub-regions may not be sufficient to generate maximal titers of neutralizing antibodies. 

h) Identification Of Toxin B Sub-Region Specific Antibodies 

20 That Neutralize Toxin B In Vivo 

lo determine if antibodies directed against the toxin B repeat region are sufficient for 
neutralization, region specific antibodies within the C I B antibody preparation were afflnitv 
purihed. and tested for in vivo neutralization. Affinity columns containing recombinant toxin 
B repeal proteins were made as described below. A separate affinity column was prepared 

25 using each of the following recombinant toxin B repeat proteins: pPB! 750-2360. pPB1850- 
2360. pi\4B1750-I970 and pMB 1 970-2360. 

for each affinity column to be made, four ml of PBS-waslied Actigel resin (Sterogene) 
was coupled overnight at room temperature with 5-10 mg of affinity purified recombinant 
protein (first extensively dialyzed into PBS) in 15 ml tubes (falcon) containing a 1/10 final 

30 volume Aid-coupling solution ( 1 M sodium cyanoborohydride). Aliquots of the supernatants 
Irom the coupling reactions, before and after coupling, were assessed by Coomassic staining 
of 7.5% SDS-fAGE gels. Based on protein band intensities, in all cases greater than 30% 
coupling efficiencies were estimated. The resins were poured into 10 ml columns (BioRad), 
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washed extensively with PBS. pre-eluted with 4M guanidine-HCI (in 10 mM Tris-HCl. pH 
8.0) and reequilibrated in PBS. The columns were stored at 4°C. 

Aliquots of a CTB IgY polyclonal antibody preparation (PEG prep) were affinity 
purified on each of the four columns as described below. The columns were hooked to a UV 
moniior (1SCO). washed with PBS and 40 ml aliquots of a 2X PEG prep (filter sterilized 
using a 0.45 u filter) were applied. The columns were washed with PBS until the baseline 
value was re-esiablished. The columns were then washed with BBStween to elute 
nonspeciilcallv binding antibodies, and reequilibrated with PBS. Bound antibodv was eluled 
from the column in 4M guanidine-HCI (in lOmM Tris-HCl. pH8.0). The eluted antibodv was 
immediately dialyzed against a 100-fold excess of PBS at 4°C for 2 hrs. The samples were 
ihen dialyzed extensively against a, leasi 2 changes of PBS. and affinity purified antibodv was 
collected and stored at 4°C. The antibody preparations were quantified by UV absorbance. 
The clution volumes were in the range of 4-8 ml. All affinity purified stocks contained 
similar total antibody concentrations, ranging from 0.25-0.35% of the total protein applied to 
ihe columns. 

The ability of the affinity purified antibody preparations to neutralize toxin B m vivo 
was determined using the assay outlined in a) above. Affinity purified antibody was diluted 
1:1 in PBS before testing. The results arc shown in Table 26. 

In all cases similar levels of toxin neutralization was observed, such that lethality was 
delayed in all groups relative to preimmune controls. This result demonstrates that antibodies 
reactive to the repeat region of the toxin 13 gene are sufficient to neutralize toxin 13 ,„ vivo. 
The hamsters will eventually die in all groups, but this death is maximally delayed with the 
CTB PEXi prep antibodies. Thus neutralization with the affinity purified (AP) antibodies is 
not as complete as that observed with the CTB prep before affinity chromatography. This 
result may be due to loss of activity during guanidine denaturaiion (during the elution of the 
antibodies from the affinity column) or the presence of antibodies specific to other regions of 
the toxin B gene that can contribute to toxin neutralization (present in the CTB PEG prep). 
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TABLE 26 

Neutralization Of Toxin B By Affinity Purified Antibodies 



Treatment group* 


Number Animals Alive'* 


Number Animals Oead h 


Preimmunc' 


0 


5 


CTB ' : 400 ng 


5 


0 


CTB (AP on pPB! 750-2360):-' 875 \i» 


5 


0 


CTB (AP on pMBI750-l970): : 875 Mg 




0 


CTB (AP on pMBI970-2360): : 500 Mg 




0 



('. diffkiie toxin B (CTB) (Tech Lab: at 5 pg/ml. 25 y\» total) at lethal concentration to 
hamsters is added to antibody and incubated Tor one hour at 37 n C. Alter incubation this 
mixture is injected intraperitoneal^ (IP) into hamsters, r.ach treatment group received toxin 
premixed with antibody raised against the indicated protein, as either: UX antibody PEG prep 
or Affinity purified (AP) antibody (from CTB PEG prep, on indicated columns). The amount 
of specific antibody in each prep is indicated: the amount is directly determined for affinity 
purified preps and is estimated for the 4X CTB as described in Example 15. 

The numbers in each group represent numbers of hamsters dead or alive. 2 hr post IP 
administration of toxin/antibody mixture. 

The observation that antibodies affinity purified against the non-overlapping 
pMB I 750-1970 and pMB 1 970-2360 proteins neutralized toxin B raised the possibility that 
either I ) antibodies specific to repeat sub-regions are sufficient to neutralize toxin B or 2) 
sub-region specific proteins can bind most or all repeat specific antibodies present in the CTB 
polyclonal pool. This would likely be due to conformational similarities between repeals, 
since homology in the primary amino acid sequences between different repeats is in the ranee 
of only 25-75% [Eichel-Streiber. c( al. (1992) Molec. Gen. Genetics 233:260). These 
possibilities were tested by affinity chromatography. 

The CTB PLKi prep was sequentially depleted 2X on the pMB 1750- 1970 column: only 
a small elution peak was observed after the second chromatography, indicating that most 
reactive antibodies were removed. This interval depleted CTB preparation was then 
chromatographed on the pPBI 850-2360 column: no antibody bound to the column. The 
reactivity of the CTB and CTB (pMBl 750-1970 depleted) preps to pPBI 750-2360. pPB1850- 
2360. pMB1750-1970 and pMBl 970-2360 proteins was then determined by l-LISA using the 
protocol described in Example 13(c). Briefly. 96-well microliter plates (Falcon, Pro-Bind 
Assay Plates) were coated with recombinant protein by adding 100 ^1 volumes of protein at 
1-2 jig/ml in PBS containing 0.005% thimcrosal to each well and incubating overnight at 4°C. 
Ihc next morning, the coating suspensions were decanted and the wells were washed three 
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times using PBS. In order to block non-specific binding sites. 100 ul of 1.0% BSA (Sigma) 
in PBS (blocking solution) was then added to each well, and the plates were incubated for I 
nr. at 37°C. The blocking solution was decanted and duplicate samples of 150 ul of diluted 
antibody was added to the first well of a dilution series. The initial testing serum dilution 
was ( 1/200 for CTB prep, (the concentration of depleted CTB was standardized by OD,.,) in 
Nocking solution containing 0.5% Tvveen 20. followed by 5-fold serial dilutions into this' 
solution. This was accomplished by serially transferring 30 ul aliquots to 120 ul buffer, 
mixing, and repeating the dilution into a fresh well. Alter the final dilution. 30 ul was 
remov ed from the well such that all wells contained 120 M | final volume. A total of 5 such 
dilutions were performed (4 wells total). The plates were incubated for I hr at 37°C. 
Following this incubation, the serially diluted samples were decanted and the wells were 
washed three times using PBS containing 0.5% Tween 20 (PBST). followed by two 5 min 
washes using BBS-Tween and a final three washes using PBST. To each well. 100 ul of 
I 1000 diluted secondary antibody [rabbit anli-chickcn IgCi alkaline phosphatase (Sigma) 
diluted in blocking solution containing 0.5% Tween 20] was added, and the plate was 
mcubated 1 hr al 37°C. The conjugate solutions were decanted and the plates were washed 6 
times in PBST. then once in 50 mM Na : C0 5 . 10 mM MgCk pH 9.5. The plates were 
developed by the addition of 100 ul of a .solution containing I mg/ml para-nitro phenyl 
Phosphate (Sigma) dissolved in 50 mM Na,CO,. 10 mM MgCI... P H9.5 to each well. The 
plates were then incubated at room temperature in the dark for 5-45 min. The absorbency of 
each well was measured al 410 nm using a Dynatech MR 700 plate reader. 

As predicted by the affinity chromatography results, depletion of the CTB prep on the 
PMBI75O-IQ70 column removed all detectable reactivity to the pMB 1970-2360 protein. The 
reciprocal purification of a CTB prep that was depleted on the pMBl 970-2360 column 
yielded no bound antibody when ehromatographed on the pMB 1 750-1970 column. These 
results demonstrate that all repeat reactive antibodies in the CTB polyclonal pool recognize a 
conserved structure that is present in non-overlapping repeats. Although it is possible that this 
conserved structure represents rare conserved linear epitopes, it appears more likely that the 
neutralizing antibodies recognize a specific protein conformation. This conclusion was also 
suggested by the results of Western blot hybridization analysis of CTB reactivity to these 
recombinant proteins. 

Western blots of 7.5% SDS-PAGC gels, loaded and electrophoresed with defined 
quantities of each recombinant protein, were probed with the CTB polyclonal amibodv 
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preparation. The blots were prepared and developed with alkaline phosphatase as described in 
Example 3. The results are shown in Figure 24. 

Figure 24 depicts a comparison of immunorcactivity of IgY antibody raised against 
either native or recombinant toxin B antigen. Equal amounts of pMB 1750- 1070 (lane 1), 
5 pMB 1970-2360 (lane 2). pPBI 850-2360 (lane 3) as well as a serial dilution of pPB 1750-2360 
(lanes 4-6 comprising IX. I/10X and I/100X amounts, respectively) proteins were loaded in 
duplicate and resolved on a 7.5% SDS-PACil: gel. The gel was blotted and each half was 
hybridized with PEG prep IgY antibodies from chickens immunized with eilher native CTB or 
pPBI 750-2360 protein. Note that' the full-length pMBl 750- 1970 protein was identified only 

10 by antibodies reactive to the recombinant protein (arrows). 

Although the CTB prep reacts with the pPB 1 750-2360. pPB 1850-2360. and pMB1970- 
2360 proteins, no reactivity to the pMB1750-1970 protein was observed (Figure 24). Given 
that ail repeal reactive antibodies can be bound by this protein during afflnitv 
chromatography, this result indicates that the protein cannot fold properly on Western blots. 

1 3 Since this eliminates all antibody reactivity, it is unlikely that the repeat reactive antibodies in 
the CTB prep recognize linear epitopes. This may indicate that in order to induce protective 
antibodies, recombinant toxin B protein will need to be properly folded. 

c) Generation And Evaluation Of Antibodies Reactive To 
-° Recombinant Toxin B Polypeptides 

i) Generation Of Antibodies Reactive To 
Recombinant Toxin B Proteins 

Antibodies against recombinant proteins were generated in egg laying Leghorn hens as 
described in l:\ample 13. Antibodies were raised (using Preunds adjuvant (Gibco) unless 
2> otherwise indicated] against the following recombinant proteins: I } a mixture of Interval 1+2 
proteins (see Figure 18); 2) a mixture of interval 4 and 5 proteins (see Figure 18); 3) 
PMB1970-2360 protein; 4) pPB 1750-2360 protein; 5) pMB! 750-2360: 6) pMBl 750-2360 
ITitermax adjuvant (Vaxcell)]: 7) pMBI 750-2360 |Gerbu adjuvant (Biotech)]: 8) P MBpl750- 
2360 protein: 9) pPB 1 850-2360: and 10) pMBI 850-2360. 

Chickens were boosted at least 3 times with recombinant protein until F.LISA 
reactivity (using the protocol described in b) above with the exception that the plates were 
coated with pPB 1 750-2360 proieinl of polyclonal PEG preps was at least equal to that of the 
CTB polyclonal antibody PEG prep. ELISA titers were determined for the PRG preps from 
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all of the above immunogens and were found to be comparable ranging from 1:12500 to 
1:62500. High titers were achieved in all cases except in 6) pMB 1750-2360 in which strong 
titers were not observed using the Titermax adjuvant, and this preparation was not tested 
further. 



i") Evaluation Of Antibodies Reactive To 
Recombinant Proteins By Western Blot 
Hybridization 

Western blots of 7.5% SDS-PACit gels, loaded and electrophorcscd with defined 
quantities of recombinant protein ( P MB1750-I970. pPB 1 850-2360. and pMB 1970-2360 
proteins and a serial dilution of the pPB 1 750-2360 to allow quantification of reactivity), were 
probed with the CTB. pPB 1 750-2360. pMBI 750-2360 and pMB 1 970-2360 polyclonal 
antibody preparations ( from chickens immunized using Frcunds adjuvant). The blots were 
prepared and developed with alkaline phosphatase as described above in b). 

As shown in figure 24. the CTB and pMB 1970-2360 preps reacted strongly with the 
pPIJI 750-2360. pPB 1 850-2.160. and pMB 1970-2360 proteins while the pPB1750-2360 and 
pMB 1 970-2360 (Gerbu) preparations reacted strongly with all four proteins. The Western 
blot reactivity of the pPB 1 750-2360 and pMB 1 970-2360 (Gerbu, preparations were equivalent 
to (hat of the CTB preparation, while reactivity of the pMB 1970-2360 preparation was <I0% 
that of the C TB prep. Despite equivalent liLISA reactivities only weak reactivity 
(approximately 1%) to the recombinant proteins were observed in PEG preps from two 
independent groups immunized with the pMB 1 750-2360 protein and one group immunized 
with the pMB 1750-2360 preparation using Freunds adjuvant. 

Affinity purification was utilized to determine if this difference in immunoreactivity by 
Western bio, analysis reflects differing antibody titers. Fifty ml 2X PEG preparations from 
chickens immunized with either pMB 1 750-2360 or pMB 1970-2360 protein were 
chromatography on the pPB 1 750-2360 affinity column from b) above, as described. The 
yield of affinity purified antibody (% total protein in preparation) was equivalent to the yield 
obtained from a CTB PF.Ci preparation in b) above. Thus, differences in Western reactivity 
reflect a qualitative difference in the antibody pools, rather than quantitative differences.. 
These results demonstrate that certain recombinant proteins are more effective at generating 
high affinity antibodies (as assayed by Western blot hybridization). 
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iii) In Vivo Neutralization Of Toxin B Using 

Antibodies Reactive To Recombinant Protein 

The in vivo hamster model [described in Examples 9 and 14(b)] was utilized to assess 
the neutralizing ability of antibodies raised against recombinant toxin B proteins. The results 
Irom three experiments are shown below in Tables 27-29. 

The ability of each immunogen to neutralize toxin B in vivo has been compiled and is 
shown in Table 30. As predicted from the recombinant protein-CTB premix studies (Table 
24) only antibodies to Interval 3 (1750-2366) and not the other regions of toxin B {i.e.. 
intervals 1-5) are protective. Unexpectedly, antibodies generated to INT-3 region expressed 
pMAI. vector (pMB 1 750-2360 and pMpB] 750-2360) using Freunds adjuvant were non- 
neutralizing. This observation is reproducible, since no neutralization was observed in two 
independent immunizations with pMB 1 750-2360 and one immunization with pMpBI750- 
2360. The fact that 5X quantities of affinity purified toxin B repeal specific antibodies from 
pMB 1750-2360 PIXi preps cannot neutralize toxin B while IX quantities of affinity purified 
anti-CTB antibodies can (Table 28) demonstrates that the differential ability of C I'B 
antibodies to neutralize toxin B is due to qualitative rather than quantitative differences in 
these antibody preparations. Only when this region was expressed in an alternative vector 
(pPB 1 750-2360) or using an alternative adjuvant with the pMB I 750-2360 protein were 
neutralizing antibodies generated. Importantly, antibodies raised using Freunds adjuvant to 
pPB 1 850-2360. which contains a fragment that is only 100 amino acids smaller than 
recombinant pPB I 750-2360. are unable to neutralize toxin B in vivo (Table 27): note also that 
the same vector is used for both pPB 1850-2360 and pPB 1 750-2360. 
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TABLE 27 



Treatment Group" 


Number Animals Alive 1 ' 


.Mil n 

Number Animals Dead h 


Preimmune 


0 


5 


CTB 


e 


0 


INTI+2 


0 


5 


INT 4*5 


0 


5 


pMB 1750-2360 


0 


5 


pMB 1970-2360 


0 


5 


pPB 1750-2360 


3 


0 



H.e numbers in each group represent numbers of hamsters dead or alive - |, 0 „rs 
iuiministrat.on of toxin/antibody mixture. " 



post IP 



TABLE 28 



Treatment Group" 


. ^ ^3,iri: /ifnniiy nirinet 
Number Animals Alive" 


J Antibodies 

Number Animals Dead 1 ' 


!Veimmune< 1) 


0 


5 


CTBO) 


5 


0 


pPB 1750-2360(1) 


5 


0 


M niL' ami-pMm750-2360P* 


1 




i ma ami-pMB1970-2360("») 


0 


4 

5 


300 ami-CTB(2) 


5 


0 



a,Xs' IIS ? 5 7' ml: ■ 25 ^ w " l:1 ** « >«■»< concentration to 

luiniMus c added to antibodv and incubated for one hnur it ii°r aa- • ■ 

=" UP ,an,, - C TB ' rc P cnI *PCCi»ic used 1/5 diluted anti-C.TB antibody). " 

The numbers in each group represent numbers of hamsters dead or alive ■> l, r nost IP 
admmistraiion of toxin/antibody mixture. " P 
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TABLE 29 



Generation Of Neutral izin«j Antibodies Utilizing The Gcrbu Adjuvant 



Treatment Group'' 


Number Animals Alive' 1 


Number Animals Dead' 1 


Preimmune 


0 


5 


CTB 


5 


0 


pMB 1970-2360 


0 


5 


pMB 1850-2360 


I) 


5 


pPB 1850-2360 


0 


5 


pMB 1750-2360 (Gcrbu adj) 


5 


0 



( difficik' toxin B (CTB) (Tech Lab) at lethal concentration to hamsters is added to aniibodx 
and incubated Tor one hour at 37°C. After incubation this mixture is injected intraperitoneal I y 
(IP) into hamsters. Lach treatment j:roup received toxin prenmed with antibody raised auninsi 
the indicated protein, as a 4X antibody PEG prep. 

The numbers in each group represent numbers of hamsters dead or alive. 2hts post IP 
administration of toxin antibody mixture. 
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Immunouen 



TABLE 30 

in Vivo Neutralization Of Toxin B 



Adjuvant 



Preimmunc 



NA* 



Tested Preparation 



PEG 



Antigen 
Utilized For AP 



NA 



In vivo 
Neutralization 1, 



CTB (native) 



Tttermax 



PEG 



NA 



CTB {native) 



Thermnx 



AP 



pPB 1 750-2360 



CTB (native) 



Tilermax 



AP 



pPB 1X50-2360 



CTB (native) 



Titermax 



AP 



pPB1750-|970 



CTB (naiive) 



Titermax 



AP 



pPB 1070-2360 



pMB 1750-2360 



pMB 1 750-2360 



pMB I 750-2360 



pMBI l )7()-236() 



pMB 1970-2360 
pPB 1750-2360 



pPB 1850-2360 



Freunds 



Freunds 



Gerbu 



Freunds 



Freunds 
Freunds 



Freunds 



PEG 



NA 



AP 



pPB 1750-2360 



PEG 



NA 



PEG 



NA 



AP 
PEG 



pPB 1750- 2360 
NA 



PEG 



NA 



no 
ves 



pMB 1X50-2360 



INT I 
INI 4-5 



Freunds 



PEG 



Freunds 
Freunds 



PEG 
PEG 



NA 



no 



NA 
NA 



no 
no 



liiihcr I'KG preparation (PRC.) or affinity purified antibodies ( AP). 

<»l lo.Mii ». 2 hours post-administration of mixture. ' 
NA' denotes not applicable. 

The pPB 1750-2360 antibody pool confers significant /„ vm, protection, equivalent to 
•hat obtained with .he affinity purified CTB antibodies. This correlates with the observed high 
aMmiiy of .his antibody pool (relative lo .he pMB 1750-2360 or pMB1970-236() pools) as 
assayed by Western blot analysis (Figure 24). These results provide .he firs, demonstration 
«"« neutralizing antibodies can be induced using recombinant toxin 13 protein as 

immunoyen. 

I ho failure of high concentrations of antibodies raised against .he pMB 1750-2160 
protein (using JYcunds adjuvant) to neutralize, while the use of Cierbu adjuvant and 
pMB 1750-2360 protein generates a neutralizing response, demonstrates that conformation or 
presentauon of this protein is essential for the induction of neutralizing antibodies. These 



WO 98/08540 



PCT/US97/15394 



results are consistent with the observation that the neutralizing antibodies produced when 
native CTB is used as an immunogen appear to recognize conformational epitopes [see section 
b) above]. This is the first demonstration that the conformation or presentation of 
recombinant toxin B protein is essential to generate high titers of neutralizing antibodies. 

5 

EXAMPLE 20 

Determination Of Quantitative And Qualitative 
Differences Between pMB 1750-2360. pMB 1750-2360 (Gerbu) 
Or pPB 1750-2360 IgY Polyclonal Antibody Preparations 

10 

In Lxample 19. it was demonstrated that toxin B neutralizing antibodies could be 
generated using specific recombinant toxin B proteins (pPBl 750-2360) or specific adjuvants. 
Antibodies raised against pMB I 750-2360 were capable of neutralizing the enteroloxin effect 
ol toxin M when the recombinant protein was used to immunize hens in conjunction with the 

b (icrbu adjuvant, but not when Freunds adjuvant was used. To determine the basis for these 
antigen and adjuvant restrictions, toxin B-spcciik antibodies present in the neutralizing and 
non-neuiralizing PEG preparations were isolated by affinity chromatography and tested for 
qualitative or quantitative differences. The example involved a) purification of anti-toxin B 
specific antibodies from pMB 1750-2360 and pPB 1750-2360 PFG preparations and b) in vivo 

20 neutralization of toxin B using the affinity purified antibody. 

a) Purification Of specific Antibodies From pMBI 750-23611 And 
pPBl 750-2360 PEC Preparations 

To purify and determine the concentration of specific antibodies (expressed as the 
25 percent of total antibody) within the pPBl 750-2360 (Freunds and Gerbu) and pPB 1750-2360 
PFG preparations, defined quantities of these antibody preparations were chromatographed on 
an affinity column containing the entire toxin B repeat region (pPBl 750-2360). The amount 
of affinity purified antibody was then quantified. 

An affinity column containing the recombinant toxin B repeat protein. pIMil 750-2360. 
30 was made as follows. Four mi of PBS-washed Acligel resin (Sterogene) was coupled with 5 
mg of pPB 1750-2360 affinity purified protein (diaiyzed into PBS: estimated to be greater than 
05% full length fusion protein) in a 15 ml tube (Falcon) containing 1/10 final volume Aid- 
coupling solution (IM sodium cyanoborohydride). Aliquots of the supernatant from the 
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coupling reactions, before and after coupling, were assessed by Coomassie staining of 7.5% 
SDS-PAGE gels. Based on protein band intensities, greater than 95% (approximately 5 mg) 
of recombinant protein was coupled to the resin. The coupled resin was poured into a 10 ml 
column (BioRad). washed extensively with PBS. pre-clutcd with 4M guanidine-HCI (in 10 
mM Tris-IICi. pi I 8.0: 0.005% thimerosal) and re-equilibrated in PBS and stored at 4°C. 

Aliquots of pMB 1750-2360. pMB 1750-2360 (Gerbu) or P PBI750-2360 IgY polyclonal 
antibody preparations (PEG preps) were affinity purified on the above column as follows. 
The column was attached to an I'V monitor (ISCO). and washed with PBS. forty ml aliquots 
of 2X Pl-G preps (filter sterilized using a 0.45 u (liter and quantified by OD, sn before 
chromatography) was applied. The column was washed with PBS until the baseline was re- 
established (the column flow-through was saved), washed with BBSTwcen to elute 
nonspccifically binding antibodies and re-equilibrated with PBS. Bound antibody was eluted 
from the column in 4M guanidine-UCI (in 10 mM Tris-HCL. pH 8.0. 0.005% thimerosal) and 
the entire elution peak collected in a 15 ml lube (Falcon). The column was re-equilibrated, 
and the column eluate rc-chromatographed as described above. The antibody preparations 
were quantified by I IV absorbance (the elution buffer was used to zero the 
spectrophotometer). Approximately 10 fold higher concentrations of total purified antibody 
was obtained upon elution of the first chromatography pass relative to the second pass. The 
low yield from the second chromatography pass indicated that most of the specific antibodies 
were removed by the first round of chromatography. 

Pools of affinity purified specific antibodies were prepared by dialysis of the column 
elutcs after the first column chromatography pass for the pMB 1 750-2360. pMB 1 750-2360 
(Gerbu) or P PB 1750-2360 IgY polyclonal antibody preparations. The elutcs were collected 
on ice and immediately dialyzed against a 100-fold volume of PBS at 4°C for 2 his. The 
samples were then dialyzed against 3 changes of a 65-fold volume of PBS at 4°C. Dialysis 
was performed for a minimum of 8 hrs per change of PBS. The dialyzed samples were 
collected, cemrifuged to remove insoluble debris, quantified by OD, S0 . and stored at 4°C\ 

The percentage of toxin B repeat-specific antibodies present in each preparation was 
determined using the quantifications of antibody yields from the first column pass (amount of 
specific antibody recovered after first pass/total protein loaded). The yield of repeat-specific 
affinity purified antibody (expressed as the percent of total protein in the preparation) in: 1 ) 
the pMB 1 750,2360 Pl-G prep was approximately 0.5%. 2) the pMB 1 750-2360 (Gerbu) prep 
was approximately 2.3%. and 3) the pPBI 750-2360 prep was approximately 0.4%. 
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Purification of a CTB IgY polyclonal antibody preparation on the same column demonstrated 
that the concentration of toxin B repeal specific antibodies in the CTB preparation was 0.35%. 

These results demonstrate that I) the use of Cierbu adjuvant enhanced the titer of 
specific antibody produced against the pMBl 750-2360 protein 5-fold relative to immunization 
using Freunds adjuvant, and 2) the differences seen in the in vivo neutralization ability of the 
pMBI 750-2360 (run neutralizing) and pPBI 750-2360 (neutralizing) and CTB (neutralizing) 
PLG preps seen in Example l l ) was not due to differences in the tilers of repeat-specific 
antibodies in the three preparations because the titer of repeat-specific antibody was similar 
for all three preps: therefore the differing ability of the three antibody preparations to 
neutralize toxin B must reflect qualitative differences in the induced toxin B repeat-specific 
antibodies. To confirm that qualitative differences exist between antibodies raised in hens 
immunized with different recombinant proteins and/or different adjuvants, the same amount of 
affinity purified anti-toxin B repeat (aa 1870-2360 of toxin B) antibodies from the different 
preparations was administered to hamsters using the in vivo hamster model as described 



below. 



I>) In vivo Neutralization Of Toxin B Using Affinity Purified 
Antibody 

fhe in vivo hamster model was utilized to assess the neutralizing ability of the affinity 
purified antibodies raised against recombinant toxin B proteins purified in (a) above. As well, 
a 4X IgY PEG preparation from a second independent immunization utilizing the pl>B1750- 
2360 antigen with Freunds adjuvant was tested for in vivo neutralization. The results are 
shown in fable 31. 

The results shown in fable 31 demonstrate that: 
I ) as shown in Example \9 and reproduced here. 1.5 mg of affinity purified 

antibody from pMBl 750-2360 immunized hens using Freunds adjuvant does 
not neutralize toxin B in vivo. However. 300 ug of affmii> purified antibody 
from similarly immunized hens utilizing Gerbu adjuvant demonstrated complete 
neutralization of toxin B ;;; vim. This demonstrates that (ierbu adjuvant, in 
addition to enhancing the titer of antibodies reactive to the pMBl 750-2360 
antigen relative to Freunds adjuvant (demonstrated in (a) above), also enhances 
the yield of neutralizing antibodies to this antigen, greater than 5 fold. 
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I) Complete in vivo neutralization of toxin B was observed with 1.5 m g of 

affinity purified antibody from hens immunized with pPBl 750-2360 antigen, 
but not with pMBI 750-2360 antigen, when Frcunds adjuvant was used. This 
demonstrates, using standardized toxin B repeat-speciiic antibody 
concentrations, that neutralizing antibodies were induced when pPBl 750-2360 
but not pMB 1750-2360 was used as the antigen with Frcunds adjuvant. 
) Complete in vivo neutralization was observed with 300 ug of pMB 1 750-2360 
(Gerbu) antibody, but not with 300 ug of pPB 1750-2360 (Frcunds) antibody. 
Thus the pMB175()-2360 (Cierbu) antibody has a higher liter of neutralizing 
antibodies than the pPBl 750-2360 (Frcunds) antibody. 
) Complete neutralization of toxin B was observed using 300 ug of CTB 

antibody |affinity purified (AP)] but not 100 M g CTB antibody (AP or PRC 
prep) This demonstrates that greater than 100 M g of toxin B repeat-specific 
antibody (anti-CTB) is necessary to neutralize 25 ug toxin B in viva in this 
assay, and that affinity purilled antibodies specific to the toxin B repeat interval 
neutralize toxin B as effectively as the PFP prep of IgY raised against the 
entire CTB protein (shown in this assay). 

As was observed with the initial pPB 1 750-2360 (IgY) PliG preparation 
l Example 19). complete neutralization was observed with a IgY PFG 
preparation isolated from a second independent group of pPB 1750-2360 
f Frcunds) immunized hens. This demonstrates that neutralizing antibodies are 
rcpmducibly produced when hens are immunized with pPB 1 750-2360 protein 
utilizing Freunds adjuvant. 
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TABLE 31 

//; vivo Neutralization Of Toxin B Using Affinity Purified Antibodies 



Treatment Group' 1 


Number Animals Alive'* 


Number Animals Dead" 


Pre immune 1 


0 


5 


CTB (300 ng) : 


5 


0 


CTB f 1 00 uy) : 


1 


4 


pMU 1750-2360 (G) (5 mg) : 


5 


0 


pMB 1750-2360 (Ci) (1.5 mg) : 


5 


0 


pMB 1750-2360 fG) (300 u«) : 


5 


0 


pMB 1750-2360 (F) (1.5 m») : 


0 


5 


pPB 1750-2360 (T) (1.5 mg) : 


5 


0 


pPB 1750-2360 <F) (300 ugr 


1 


4 


CTB (100 n»>- 


-» 




pPB 1750-2360 (F) (500 uy) 


5 


0 



f ' difficile toxin B (CTB) (Tech Lab) at lethal concentration to hamsters (25 n») was added 
the antibody (amount of specific antibody is indicated) and incubated for one hour at 37°C. 
Alter incubation, [his mixture was injected IP into hamsters ( I '5 total mix injected per 
hamster). Rach treatment group received toxin premixed with amihocK raised auatnsi the 
indicated protein (G uerbti adjuvant. I'-Freunds adjuvant). ! indicates the antibody w; ls a 4X 
IgY Pl-Ci prep: ' indicates the antibody was affinity purified on ;i pPB 1 850-236O resin: and 
indicates thm the antibody was a IX IgY Pl-G prep. 

The numbers in each uroup represent numbers of hamsters dead or alive. 2 hrs post IP 
administration of tax in- ami body mixture. 



KXAMPLE 21 

Diagnostic Knzyme Immunoassays l or ( * difficile Toxins A And 13 

I he ability of the recombinant toxin proteins and antibodies raised against these 
recombinant proteins (described in the above examples) to form the basis of diagnostic assays 
lor the detection of clostridial toxin in a sample was examined. Two immunoassay formats 
were tested to quantitatively detect ( '. difficile toxin A and toxin H from a biological 
specimen. The first formal involved a competitive assay in which a fixed amount of 
recombinant toxin A or B was immobilized on a solid support U'.», microliter plate wells) 
followed hy the addition of a toxin-containing biological specimen mixed with affinity- 
purified or PHG fractionated antibodies against recombinant toxin A or B. If loxin is present 
m a specimen, this loxin will compete with the immobilized recombinant loxin protein for 
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binding to the anti-recombinant antibody thereby reducing the signal obtained following the 
addition of a reporter reagent. The reporter reagent detects the presence of antibody bound to 
.he immobilized toxin protein. 

In the second format, a sandwich immunoassay was developed using affinity-purified 
antibodies to recombinant toxin A and B. The affinity-purified antibodies to recombinant 
toxin A and B were used to coat microliter wells instead of the recombinant polypeptides (as 
was dene in the competitive assay format). Biological samples containinu toxin A or B were 
then added to the wells followed by the addition of a reporter reagent to detect the presence 
of bound toxin in the well. 



a) Competitive Immunoassay For The Detection Of C difficile 



Toxin 



Recombinant toxin A or B was attached to a solid support by coating % well 
micro.i.er plates with the toxin protein at a concentration of l M g, m | j n |>BS. The plates were 
incubated overnight at 2-8°C. The following morning, the coating solutions were removed 
and the remaining protein binding sites on the wells were blocked by fi|| lnu each well with a 
I'BS solution containing 0.5% BSA and 0.05% Tween-20. Native C. difficile toxin A or B 
( lech l„b, was diluted to 4 M g/ m | in stool extracts from healthy Syrian hamsters (Sasco). 
The stool extracts were made by placing fecal pellets in a 15 ml centrifuge tube: PBS was 
added at 2 ml/pellet and the lube was vortexed to create a uniform suspension. The tt.be was 
then ccntrifuged at 2000 rpm for 5 min at room temperature. The supernatant was removed- 
•Ins comprises the stool extract. Fifty ui of the hamster stool extract was pipetted into each 
well of the microliter plates to serve as the diluent for serial dilutions of the 4 uu /ml toxin 
samples. One hundred ul of the toxin samples at 4 H g/ m | was pipetted into the firs, row of 
wells in the microliter plate, and 50 M | aliquots were removed and diluted serially down the 
plate in duplicate. An equal volume or affinity purified anti-recombinant toxin antibodies 1 1 
ng/well of anii-pMA 1 870-2680 antibody was used for the detection of toxin A: 0 5 ng/well of 
»nii-pMBI750-2360«icrht„ was used for the detection of toxin R| were added ,o appropriate 
wells, and .he plates were incubated at room temperature for 2 hours with gentle agitation. 
Wells serving as negative control contained antibody but no native toxin to compete for 
binding. 

Unbound toxin and antibody were removed by washing the plates 3 to 5 times with 
PBS containing 0.05% Tween-20. Following the wash step. .00 M | of rabbit anti-chicken igG 
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antibody conjugated to alkaline phosphatase (Sigma) was added to each well and the plates 
were incubated for 2 hours at room temperature. The plates were then washed as before to 
remove unbound secondary antibody. Freshly prepared alkaline phosphatase substrate ( I 
mg/ml p-nitrophenyl phosphate (Sigma) in 50 mM Na.CO.. pH 9.5: ]() m M MgCU was 
added to each well. Once sufficient color developed, the plates were read on a Oynatech 
MR700 microliter plate reader using a 4)0 nm filler. 

The results are summarized in Tables 32 and 33. I- or the results shown in Table 32. 
the wells were coated with recombinant toxin A protein (pMA 1 870-2680). The amount of 
native toxin A added (present as an addition to solubilized hamster stool) to a given well is 
indicated (0 to 200 ng). Antibody raised against the recombinant toxin A protein. pMAI870- 
2680. was affinity purified on the an affinity column containing pPA 1870-2680 (described in 
Example 20). As shown in Table 32. the recombinant toxin A protein and affinity-purified 
antitoxin can be used for the basis of a competitive immunoassay for the detection of toxin A 
in biological samples. 

Similar results were obtained using the recombinant toxin B. pPB 1 750-2360. and 
antibodies raised against pMBl750-2360((icrbu). for the results shown in Table 33. the 
wells were coated with recombinant toxin B protein (pI'Bl 750-2360). The amount of native 
toxin B added (present as an addition to solubilized hamster stool) to a given well is indicated 
(0 to 200 ng). Antibody raised against the recombinant toxin B protein. pMUI750- 
2360((ierbu). was affinity purified on the an affinity column containing pl J B 1 850-2360 
(described in lixample 20). As shown in Table 33. the recombinant toxin B protein and 
affinii> -purified antitoxin can be used for the basis of a competitive immunoassay for the 
detection of toxin B in biological samples. 

In this competition assay, the reduction is considered significant over the background 
levels at all points: therefore the assay can be used to detect samples containing less than 12.5 
ng toxin A/well and as little as 50-100 ng toxin B/well. 
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TABLE 32 



n» Toxin A/Well 


OD lt „ Readout 


200 


0.(76 


100 


0.253 


50 


0.240 


25 


0.259 


12.5 


0.309 


0.25 


0.367 


3.125 


0.4 1 7 


0 


0.590 


TABLE 33 

("ompethive Inhibition Of Ami-C Jiffid/c Toxin B Bv Native Toxin H 


nii Toxin B/Well 


OD n „ Readout 


200 


0.392 


100 


0.566 


50 


0.607 


25 


0.778 


12.5 


0.970 


6.25 


0.902 


.VI 25 


1.040 


0 


1.055 



I hese competitive inhibition assays demonstrate that native C. difficile toxins and 
recombinant ( '. difficile toxin proteins can compete for binding to antibodies raised against 
recombinant ( '. difficile toxins demonstrating that these ami-recombinant toxin antibodies 
provide effective diagnostic reagents. 



») Sandwich Immunoassay For The Detection Of C. difficile 
Toxin 

Affinity-purified antibodies against recombinant toxin A or toxin li were immobilized 
to % well microliter plates as follows. The wells were passively coated overnight at 4°C 
with affinity purified antibodies raised against either pMA 1 870-26X0 (toxin A) or pMM750- 
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2360(Gerbu> (toxin B). The antibodies were affinity purified as described in Example 20. 
The antibodies were used at a concentration of I ug/ml and 100 ul was added to each 
microliter well. The wells were then blocked with 200 ul of 0.5% BSA in PBS for 2 hours 
a. room temperature and the blocking solution was then decanted. Stool samples from healthy 
Syrian hamsters were resuspended in PBS, pi I 7.4 (2 ml PBS/slool pellet was used to 
resuspend the pellets and the sample was centrifuged as described above). The stool 
suspension was then spiked with native C. difficile toxin A or B (Tech Lab) at 4 M g/ml. The 
stool suspensions containing toxin (either toxin A or toxin B) were then serially diluted two- 
fold in stool suspension without toxin and 50 ul was added in duplicate to the coated 
microliter wells. Wells containing stool suspension without toxin served as the negative 
control. 

The plates were incubated for 2 hours at room temperature and then were washed 
three times with PBS. One hundred ul of either goat umi-notivc toxin A or goal ami-native 
toxin B (Tech l.ab) diluted 1:1000 in PBS containing 1% BSA and 0.05% Twecn 20 was 
added to each well . The plates were incubated for another 2 hours at room temperature. 
The plates were then washed as before and 100 ul of alkaline phosphatase-conjugated rabbit 
anti-goat lg(i (C appel. Durham. N.C.) was added at a dilution of 1:1000. The plates were 
incubated for another 2 hours at room temperature. The plates were washed as before then 
developed by the addition of 100 ul/well of a substrate solution containing I mg/ml p . 
nitrophc.n l phosphate (Sigma) in 50 mM NaXO, pi I 9.5: 10 mM MgCL The absorbancc of 
each well was measured using a plate reader (Dynatceh) at 410 nm. The assay results are 
shown in Tables 34 and 35. 

TABLE U 



ng Toxin A/Well 


OD 4 ,„ Readout 


200 


0.9 


100 


0,8 


50 


0.73 


25 






0.71 


12.5 






iiy> 


6.25 


<M21 


0 


0 



- 140 - 



WO 98/08540 



PCT/US97/15394 



TABLE 35 



nu Toxin B/Well 


- "miwwuiva /a^uimm io\in n 

OD n „ Readout 


200 


1.2 


100 


0.973 


50 


0.887 


25 


0.846 


12.5 


0.65 1 


6.25 


0.431 


0 


0.004 



The results shown in Tables 34 and 35 show that antibodies raised against recombinant 
l»xin A and toxin 13 fragments can be used to detect the presence ulT. difficile toxin in stool 
samples. These antibodies form the basis lor a sensitive sandwich immunoassay which is 
capable of detecting as little as 6.25 ng of either toxin A or 13 in a 50 M l stool sample. As 
shown above in Tables 34 and 35. the background for this sandwich immunoassay is 
extremely low: therefore, the sensitivity of this assay is much lower than 6.25 ng toxin/well. 
It is likely that toxin levels of 0.5 to 1.0 pg/well could be detected by this assay. 

The results shown above in Tables 32-35 demonstrate clear utility of the recombinant 
reagents in f. difficile toxin detection systems. 

EXAMPLE 22 

Construction And Expression Of C. hmulinum C Fragment Fusion Proteins 

The C hmulinum type A neurotoxin gene has been cloned and sequenced [Thompson. 
a «/.. liur. .1. Biochem. 189:73 <!<W<»|. The nucleotide sequence of the toxin gene is 
available from the EMBL/GenBank sequence data banks under the accession number X52066: 
the nucleotide sequence of the coding region is listed in SEQ ID NO:27. The amino acid 
sequence of the C. hotulinum type A neurotoxin is listed in SEQ ID NO:2X. The type A 
neurotoxin gene is synthesized as a single polypeptide chain which is processed to form a 
climer composed of a light and a heavy chain linked via disulfide bonds. The 50 kD carboxy- 
terminal portion of the heavy chain is referred to as the C fragment or the 1 1, domain. 

Previous attempts by others to express polypeptides comprising the C fragment of C. 
homlinum type A toxin as a native polypeptide [e.g.. not as a fusion protein) in £ coli have 
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been unsuccessful [H.F. LaPenotiere. et ai in Botulinum and Tetanus Neurotoxins. DasGupta. 
Ed.. Plenum Press. New York (1993). pp. 463-466]. Expression of the C fragment as a 
fusion with the E. coii MBP was reported to result in the production of insoluble protein 
(H.F. LaPenotiere. et ai. supra). 
^ In order to produce soluble recombinant C fragment proteins in £. coii. fusion proteins 

comprising a synthetic C fragment gene derived from the C hotulinum type A toxin and 
either a portion of the C. difficile toxin protein or the MBP were constructed. This example 
involved a) the construction of plasmids encoding C fragment fusion proteins and b) 
expression off. hotulinum C fragment fusion proteins in coii, 

10 

ii) Construction Of Plasmids Encoding C Fragment Fusion 
Proteins 

In F.xample I I. it was demonstrated that the C. difficile toxin A repeal domain can be 
elflciently expressed and purified in E. coti as either native (expressed in the pLT 23a vector 

15 in clone pPAl 870-2680) or fusion (expressed in the pMALc vector as a fusion with the 

coii MBP in clone pMAl 870-2680) proteins. Fusion proteins comprising a fusion between 
the MBP. portions of the (*. difficile toxin A repeat domain (shown to be expressed as a 
soluble fusion protein) and the C fragment of the C. hotulinum type A toxin were constructed. 
A fusion protein comprising the C fragment of the (\ hotulinum type A toxin and the MBP 

20 was also constructed. 

figure 25 provides a schematic representation of the botulinal fusion proteins along 
with the donor constructs containing the ( * difficile toxin A sequences or ( : hotulinum C 
fragment sequences which were used to generate the botulinal fusion proteins. In Figure 25. 
the solid boxes represent C. difficile toxin A gene sequences, the open boxes represent ( 

25 hotulinum C fragment sequences and the solid black ovals represent the E. coii MBP. When 
the name for a restriction enzyme appears inside parenthesis, this indicates that the restriction 
sue was destroyed during construction. An asterisk appearing with the name for a restriction 
enzyme indicates that this restriction site was recreated at the cloning junction. 

In figure 25. a restriction map of the pMA 1870-2680 and pPA I 100-2680 constructs 

30 (described in Mxample 1 1 ) which contain sequences derived from the ( difficile toxin A 

repeat domain are shown: these constructs were used as the source of r. difficile toxin A gene 
sequences for the construction of plasmids encoding fusions between the ( hotulinum C 
fragment gene and the ('. difficile toxin A gene. The pMA 1870-2680 expression construct 
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expresses high levels of soluble, intact fusion protein (20 mg/liter culture) which can be 
affinity purified on an amylose column (purification described in Example 1 Id). 

The pAherBot construct (Figure 25) was used as the source of C. hotulinum C 
fragment gene sequences for the botulinal fusion proteins. pAlterBot was obtained from J. 
Middle-brook and R. Lemley at the U.S. Department of Defense. pAlterBot contains a 
synthetic ('. hotulinum C fragment inserted in to the pALTER-lflfc vector (Promcga). This 
synthetic C fragment gene encodes the same amino acids as does the naturally occurring C 
fragment gene. The naturally occurring C fragment sequences, like most clostridial genes, are 
extremely A/T rich (Thompson c, uL .supra). This high A/T content creates expression 
difficulties in E. coii and yeast due to altered codon usage frequency and fortuitous 
polyadenylation sites, respectively. In order to improve the expression of C fragment proteins 
in /•-. cali. a synthetic version of the gene was created in which the non-preferred anions were 
replaced with preferred codons. 

The nucleotide sequence of the C. hotulinum C fragment gene sequences contained 
within pAlterBot is listed in SEQ ID NO:22. The first six nucleotides (ATGGCT) encode a 
methionine and alanine residue, respectively. These two amino acids result from the insertion 
of the C hotulinum C fragment sequences into the pALTER® vector and provide the initiator 
methionine residue. The amino acid sequence of the C. hotulinum C fragment encoded by the 
sequences contained within pAlterBot is listed in SEQ ID NO:23. The first two amino acids 
(Met Ala) are encoded by vector-derived sequences. From the third amino acid residue 
"invard ( A,g|. ihe amino acid sequence is identical to that found in the C. hotulinum type A 



toxin none. 



The pMA 1 870-2680. pPA 1 100-2680 and pAlterBot constructs were used as progenitor 
plasmids to make expression constructs in which fragments of the C. diffkil,- toxin A repeat 
domain were expressed as genetic fusions with the ('. hotulinum C fragment gene using the 
pMAI.-e expression vector (New England Biol.abs). The pMAL-c expression vector 
generates fusion proteins which contain the MBP at the amino-terminal end of the protein. A 
construct. pMBol. in which the C. hotulinum C fragment gene was expressed as a fusion with 
only the MBP was constructed (Figure 25). Fusion protein expression was induced from £ 
co/i strains harboring the above plasmids. and induced protein was affinity purified on an 
amylose resin column. 
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i) Construction Of pBlucBot 

In order to facilitate the cloning of the C hatuiinum C fragment gene sequences into a 
number of desired constructs, the botulinal gene sequences were removed from pAllcrBot and 
were inserted into the pBluescript plasmid (Stratagene) to generate pBlueBot (figure 25). 
pBlucBot was constructed as follows. Bacteria containing the pAltcrBot plasmid were grown 
in medium containing tetracycline and plasmid DNA was isolated using the QlAprep-spin 
Plasmid Kit (Qiagen). One microgram of pAllerBot DNA was digested with Xcnl and the 
resulting .V recessed sticky end was made blunt using the Klenow fragment of DNA 
polymerase 1 (here after the Klenow fragment). The pAllerBot DNA was then digested with 
///Will to release the botulinal gene sequences (the Bot insert) as a blunt (tilled ,\a>\ site)- 
///WIII fragment. pBluescript vector DNA was prepared by digesting 200 ng of pBluescript 
DNA with Smal and ///Will. The digestion products from both plasmids were resolved on an 
agarose gel. The appropriate fragments were removed from the gel. mixed and purified 
utilizing the Prep-a-Gcne kit (BioRad). The elutcd DNA was then ligated using 14 DNA 
ligase and used to transform competent DH5a cells (Gibco-BRI). Host cells were made 
competent for transformation using the calcium chloride protocol of Sambrook ei at., supra at 
I.S2-I.X.V Recombinant clones were isolated and continued by restriction digestion using 
standard recombinant molecular biology techniques (Sambrook e/ ul. supra). The resultant 
clone. pBlucBot. contains several useful unique restriction sites Hanking the Bot insert (i.e.. 
the (' hoiulimim C fragment sequences derived from pAlterBot) as shown in figure 25. 

ii) Construction Of C. difficile / C. bolulinum I 
MBP Fusion Proteins 

Constructs encoding fusions between the C. difficile toxin A gene and the C. hotulinum 
C iragment gene and the MBP were made utilizing the same recombinant DNA methodology 
outlined above: these fusion proteins contained varying amounts of the ( '. difficile toxin A 
repeat domain. 

The pMABot clone contains a 2.4 kb insert derived from the C difficile toxin A gene 
fused to the Bot insert {i.e. the C. hniulinum C fragment sequences derived from pAllerBot). 
pMABot (figure 25) was constructed by mixing gel-purified DNA from .Y,,/ 1 //-//Will digested 
pBlucBot (the 1.2 kb Bot fragment). Spe\INot\ digested pI»A 1 100-2680 (the 2.4 kb ('. difficile 
toxin A repeat fragment) and \ha\lllind\\\ digested pMAl.-c vector. Recombinant clones 
were isolated, confirmed by restriction digestion and purified using the QIAprcp-spin Plasmid 
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Kit (Qiagen). This clone expresses (he toxin A repeats and the botulinal C fragment protein 
sequences as an in-frame fusion with the MBP. 

The pMCABot construct contains a 1.0 kb insert derived from the C. difficile toxin A 
gene fused to the Bot insert {i.e. the C. hoiulimim C fragment sequences derived from 
pAlterBoi). pMCABot was constructed by digesting the pMABot clone with EcoR\ to 
remove the 5" end of the C. difficile loxin A repeat (see Figure 25. the pMAL-c vector 
contains a Eco\\\ site 5' to the C. difficile insert in the pMABot clone). The- restriction sites 
were filled and rcligated together after gel purification. The resultant clone (pMCABot. 
f igure 25) generated an in-frame fusion between the MBP and the remaining y portion of the 
( difficile toxin A repeal domain fused to the Bot gene. 

The pMNABot clone contains the I kb SpeUEvoRl Hilled) fragment from the (\ 
difficile toxin A repeat domain (derived from clone pPAl 100-2680) and the 1.2 kb ( '. 
h„,„lim,m C fragment gene as a Sco\ ( filled >////*/! I! fragment (derived from pAltcrBot). 
These two fragments were inserted into the pMAL-c vector digested with X/uiUHmdlU. The 
two insert fragments were generated by digestion of the appropriate plasmid with EcoR\ 
(pPAl 100-2680) or AVol (pAlterBot) followed by treatment with the Klenow fragment. After 
treatment with the Klenow fragment, the plasmids were digested with the second enzyme 
(either Spe\ or HindUl). All three fragments were gel purified, mixed and Prep-a-Genc 
purified prior to ligation. Following ligation and transformation, putative recombinants were 
analyzed by restriction analysis: the EcoRl site was found to be regenerated at the fusion 
junction, as was predicted for a fusion between the filled EcoRl and Ami sites. 

A construct encoding a fusion protein between the botulinal C fragment gene and the 
MBP gene was constructed (,>.. this fusion lacks any C. difficile toxin A gene sequences) and 
termed pMBot. The pMBot construct was made by removal of the ( /. difficile toxin A 
sequences from the pMABot construct and fusing the C fragment gene sequences to the MBP. 
This was accomplished by digestion of pMABot DNA with .SV/,1 (located in the pMALc 
polylinker 5" to the Xha\ site) and Xbal (located .V to the Nol\ site at the toxA-Bot fusion 
junction), fillinu in the Xha\ site using the Klenow fragment, gel purifying the desired 
restriction fragment, and ligating the blunt ends to circularize the plasmid. Following ligation 
and transformation, putative recombinants were analyzed by restriction mapping of the Bot 
insert (i.e. the C. hoiulimim C fragment sequences). 
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10 



20 



b) Expression Of C botulinum C Fragment Fusion Proteins In 

E, coli 

Large scale (1 liter) cultures of the pMAL-c vector, and each recombinant construct 
described above in (a) were grown, induced, and soluble protein fractions were isolated as 
described in Example 18. The soluble protein extracts were chromatographed on amylosc 
affinity columns to isolate recombinant fusion protein. The purified recombinant fusion 
proteins were analyzed by running samples on SDS-PAGE gels followed by Coomassic 
staining and by Western blot analysis as described (Williams at ai (1994) supra]. In brief, 
extracts were prepared and chromatographed in column buffer (10 mM NaP0 4 . 0.5 M NaCl. 
10 mM p-mercaptocihanol. pH 7.2) over an amylose resin (New England Biolabs) column, 
and eluted with column buffer containing 10 mM maltose as described [Williams, ct aL 
(1994). supra]. An SDS-PAGF gel containing the purified protein samples stained with 
Coomassic blue is shown in Figure 26. 

in Figure 26. the following samples were loaded. Lanes 1-6 contain protein purified 
from /:. call containing the pMAL-c. pPA 1870-2680. pMABot. pMNABot. pMCABot and 
pMBoi plasmids. respectively. Lane 7 contains broad range molecular weight protein markers 
(BioRad). 

The protein samples w ere prepared for electrophoresis by mixing 5 H of eluted protein 
with 5 mI of 2X SDS-PAGE sample buffer (0.125 mM Tris-HCI. pi I 6.8. 2 mM EDTA. 6% 
SDS. 20% glycerol. 0.025% bromophenol blue; (i-mercaptoethanol is added to 5% before 
use). The samples were heated to 95°C for 5 min. then cooled and loaded on a 7.5% agarose 
SDS-PACiL gel. Broad range molecular weight protein markers were also loaded to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
delected generally by staining the gel with Coomassic blue. 

In all cases the yields were in excess of 20 mg fusion protein per liter culture (see 
Table 36) and. with the exception of the pMCABot protein, a high percentage [i.e.. greater 
than 20-50% of total eluted protein) of the eluted fusion protein was of a MW predicted for 
the full length fusion protein (Figure 26). It was estimated (by visual inspection) that less 
than 10% of the pMCABot fusion protein was expressed as the full length fusion protein. 
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TABLE 36 



15 



20 



10 



Construct 

VI 1311 UV. 1 


Yield (mu/htcr of Culture) 


Percentage Of Total 
Soluble Protein 


pMABoi 


24 


5.0 


pMCABot 


34 


5.0 


pMNABot 


40 


5.5 


pMBot 


22 


5.0 


pMA 1870-2680 


40 


4.8 



These results demonstrate that high level expression of intact C. hniulimm C 
fragment : difficile toxin A fusion proteins in E cnli is feasible using the pMAL-c 
expression system. These results are in contrast to those reported by H. I\ LaPcnotierc. ci al. 

supra. In addition, these results show that it is not necessary to fuse the bolulinal C 
fragment gene to the (/. difficile toxin A gene in order to produce u soluble fusion protein 
using the pMAI.-c system in E cnli. 

In order to determine whether the above-described botulinal fusion proteins were 
recognized by anti-( • hoiulinum toxin A antibodies. Western blots were performed. Samples 
containing a ftinitx -purified proteins from /:. colt containing the pMABot. pMCABot. 
PMNABot. pMBot. pM A 1 870-2680 or pMALc plasmids were analyzed. SDS-PAGH gels 
(7.5% acrylamide) were loaded with protein samples purified from each expression construct. 
Alter electrophoresis, the gels were blotted and protein transfer was confirmed by Ponceau S 
staining (as described in Example 12b). 

following protein transfer, the blots were blocked by incubation for I hr at 20°C in 
blocking buffer |PBST (PBS containing 0.1% Tween 20 and 5% dry milk>|. The blots were 
then incubated in 10 ml of a solution containing the primary antibody: this solution comprised 
a 1/500 dilution of an anli-C hom/inum toxin A IgY PEG prep (described in Example 3) in 
blocking buffer. The blots were incubated for I hr at room temperature in the presence of the 
primary antibody. The blots were washed and developed using a rabbi, ami-chicken alkaline 
phosphatase conjugate (Bochringer Mannheim) as the secondary antibody as follows. The 
rabbit anti-chicken antibody was diluted to I ug/ml in blocking buffer (10 ml final volume 
per blot) and the blots were incubated at room temperature for I hour in the presence of the 
secondary antibody. The blots were then washed successively with PBST. BBS-Tween and 
50 mM Na.CO,. pH 9.5. The blots were then developed in freshly-prepared alkaline 



- 147 - 



WO 98/08540 PCT/US97/15394 

phosphatase substrate buffer (100 Mg/ml nitro blue tctrazoiium. 50 ^ig/mi 5-bromo-chloro- 
indolylphosphate. 5 mM MgCh in 50 mM Na : CO,. pH 9.5). Development was stopped by 
flooding the blots with distilled water and the blots were air dried. 

This Western blot analysis detected anti-C. hoiulimtm toxin reactive proteins in the 
5 pMABot. pMCABoi. pMNABot and pMBot protein samples {corresponding to the predicted 
full length proteins identified above by Coomassie staining in Figure 26), but not in the 
pMAl ! 00.2680 or pMALc protein samples. 

These results demonstrate that the relevant fusion proteins purified on an amylose resin 
as described above in section a) contained immunorcactive (' hotulimtm C fragment protein as 
10 predicted. 

EXAMPLE 23 

Generation Of Neutralizing Antibodies 
By Nasal Administration Of pMBot Protein 

15 

!he ability of ihe recombinant botulinal toxin proteins produced in Example 22 to 
stimulate a systemic immune response against botulinal toxin epitopes was assessed. This 
example involved: a> the evaluation of the induction of serum IgG titers produced by nasal or 
oral administration of botulinal toxin-containing C. difficile toxin A fusion proteins and b) 
20 the in vivo neutralization of C. homlumm type A neurotoxin by anti- recombinant ( 
hoiitlinum C fragment antibodies. 

a) Evaluation Of The Induction Of Serum IgC l iters Produced 
By Nasal Or Oral Administration Of Botulinal Toxin- 
- 5 C ontaining C difficile Toxin A Fusion Proteins 

Six groups containing five 6 week old CF female rats (Charles River) per group were 
immunized nasally or orally with one of the following three combinations using protein 
prepared in Example 22: (1) 250 M g pMBot protein per rat (nasal and oral); 2) 250 pg 
pM Allot protein per rat (nasal and oral); 3) 125 pg pMBot admixed with 125 |ag pMAIX70- 
2680 per rat (nasal and oral). A second set of 5 groups containing 3 (T female rats/group 
were immunized nasally or orally with one of the following combinations (4) 250 pg 
pMNABot protein per rat (nasal and oral) or 5) 250 ug pMAl.-e protein per rat (nasal and 
oral). 



30 
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The fusion proteins were prepared for immunization as follows. The proteins (in 
column buffer containing 10 mM maltose, were diluted in 0. 1 M carbonate buffer. P H 9 5 and 
administered orally or nasally in a 200 ul volume. The rats were lightlv sedated with ether 
prior to administration. The oral dosing was accomplished using a 20 gauge feeding needle. 
The nasal dosing was performed using a P-200 micro-pipettor (Gilson). The rats were 
boosted 14 days after the primary immunization using the techniques described above and 
were bled 7 days later. Rats from each group were lightly etherized and bled from the tail 
The blood was allowed to clot at 37°C for I hr and the serum was collected. 

The serum from individual rats was analyzed using an F.LISA to determine the anti-C 
"<"<</<-<» type A toxin IgG serum titer. The ELISA protocol used is a modification of that 
Penned .n Example 13c. Bricily. 96- well microtiter plates (Falcon. Pro-Bind Assav Plates) 
were coated u,th C hoiuihwm type A toxoid (prepared as described in Example 3a) bv 
Placing 100 u. volumes of ( : ho.ulinum cype A toxoid at 2.5 ug/ml in PBS containing 
<»-005% thimerosal in each well and incubating overnight at 4°C. The next morning, the 
coating suspensions were decanted and all wells were washed three times using PBS. 

In order to block non-specific binding sites. 1 00 ul of blocking solution [0.5% BSA in 
PBS | was then added to each well and the plates were incubated for 1 hr at 37°(.\ The 
Mocking solution was decanted and duplicate samples of 150 M | of diluted rat serum added to 
the hrsi well of a dilution series. The initial testing serum dilution was 1:30 in blocking 
solution containing 0.5% Tween 20 followed by 5-fold dilutions into this solution. This" was 
uccomplwhcd by serially transferring 30 ul aliquots to 120 M | blocking solution containing 

1 ween 20. mixing, and repeating the dilution into a fresh well. After the final dilution 
■'<> Ml -as removed from the well such that all wells contained 120 ul final volume. A total 
ol 3 such dilu.io.Ls were performed (4 wells total). The plates were incubated 1 hr a. 37°C 
f ollowing this incubation, the serially diluted samples were decanted and the wells were 
washed six t.mes using PBS containing 0.5% Tween 20 (PBST). To each well. 100 ul of a 
rabmt anti-Rat IgG alkaline phosphatase (Sigma) diluted (1/1000) in blocking buffer 
containing 0.5% Tween 20 was added and the plate was incubated for I hr at 37°C The 
conjugate solutions were decanted and the plates were washed as described above, substituting 
M) mM Na,CO,. P H 9.5 for the PBST in the final was., The plates were developed bv the & 
addmon of .00 ul of a solution containing I m g/ml para-nitro phenvl phosphate (Sigma) 
dissolved in 50 mM Na ; CO, .0 mM MgCl, pH 9.5 to each well, and incubating the plates a, 
->n, temperature in the dark for 5-45 min. The absorbency of each well was measured at 
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410 nm using a Dynatech MR 700 plate reader. The results arc summarized in Tables 37 and 
38 and represent mean serum reactivities of individual mice. 



TABLE 37 

Determination OI Ami-C homhtuwt Type A lo.\in Scrum lyG Titers I nllmvini; Immunization With ( \ hotitiuimn ( ' l : r;mincnt-C'oiiiainin» 

I "us ion I 'rote ins 



Kmite nl In 


lmuni/.aiion 


Nasal 


Oral 


linmunoiien 


piu:- 

IMMONF 


pMIJot 


pMliot & 
pMAlX70- 
2680 


pM Allot 


pMliot 


pMUot* 
pMAIX70- 
2UHU 


pMAliot 


Dilution 








l..il) 


(1 0X0 


1 .040 


1 .0.10 


0 OhO 


0 |')0 


0 0X0 


0.120 


1 I Mi 


OOI7 


o.5«() 




0022 


0 07(1 


0.02U 


0 027 




0.009 


0.2X0 


0.260 


1)010 


(t 020 


0010 


0014 


l:.*750 


0 007 


0.0X4 


(toon 


0 00') 


0 000 


ooio 


0 007 


« Kiib lested 




•\ 


5 






i 





vumhi-rs represent the a\ cruse values obtained Iron) two lil.l.SA plates. siaiuiartii/eJ uiili/iuu the nrtrinmumc control 



TABLE 38 



Determination Of Anti-C honthmtm Type A Toxin Serum IgCJ Titers 
Following Immunization With ( ', hotulinum C Frniznient-Comaintni! Fusion Proteins 



Route of Immunization 


Nasal 


Oral 


Immunnjien 


PRE-IMMUNE 


pMBot 


pMABot 


pMNABol 


pMNAHot 


Dilution 






1:30 


0.040 


0.557 


0.010 


0.0 1 5 


• 

0.010 


1:150 


0.009 


0.383 


0.001 


0.003 


0.002 


1:750 


0.001 


0.140 


0.000 


0.000 


0.000 


1:3750 


0.000 


0.040 


0.000 


o.ooo 


O.OOO 


« Rats Tested 




1 


1 


3 





The above KLISA results demonstrate that reactivity against the bolulinal fusion 
proteins was strongest when the route of administration was nasal; only weak responses were 
stimulated when the bolulinal fusion proteins were givim orally. Nasally delivered pMbot and 
pMRot admixed with pMA I 870-2680 invoked the greatest serum lg(j response. These results 
show that only the pMBot protein is necessary to induce this response, since the addition of 
(he pMA 1 870-2680 protein did not enhance antibody response ( Fable 37). Placement of the 
C. difficile toxin A fragment between the MBP and the C. hofulini/m C fragment protein 
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dramatically reduced anti-bot IgG titer (see results using pMABot. pMCABot and pMNABot 
proteins). 

This study demonstrates that the pMBot protein induces a strong serum IgG response 
directed against C. botulinum type A toxin when nasally administered. 

b) lit Vivo Neutralization Of C botulinum Type A Neurotoxin 
By Anti- Recombinant C. botulinum C Fragment Antibodies 

The ability of the anti-C. botulinum type A toxin antibodies generated bv nasal 
administration of recombinant botulinal fusion proteins in ra.s (Example 22) to neutralize C 
W/n,«„ ,ypc A toxin was tested in a mouse neutralization model. The mouse model is the 
an accepted method lor detection of botulinal toxins in body fluids and for the evaluation of 
ani,-hotulim,l antibodies [EJ. Schantz and D.A. Kautter. ... Assoc. Off. Anal. Chem. 61:96 
<W0) and Investigational New Drug (BB-IND-3703) application by the Surueon General of 
•he nepar,men, of the Army to the Federal Food and Drug Administration |. The anti-C. 
botulinum type A toxin antibodies were prepared as follows. 

Rats from the group given pMBot protein by nasal administration were boosted a 
second time with 250 m pMBot protein per rat and serum was collected 7 davs later. Serum 
iVom one rat from this group and from a preimmune rat was tested for anti-C botulinum type 
A toxin neutralizing activity in the mouse neutralization model described below. 

The |.D.„ of a solution of purified C botulinum type A toxin complex, obtained from 
Dr. Hric Johnson (University of Wisconsin Madison), was determined using the intraperitoneal 
<H>> method of Schantz and Kautter [J. Assoc. Off. Anal. Chem. 61:96 (1978)) usme 18-22 
gram female ICR mice and was found to be 3500 I.D./ml. The determination of the LD,„ was 
performed as follows. A Type A toxin standard was prepared by dissolving purilled tvpe A 
u«.n complex in 25 mM sodium phosphate buffer. P H 6.8 to yield a stock toxin solution of 

x 10' LD 4 „/mg. The 01), s of the solution was determined and the concentration was 
adjusted to 10-20 M g/ml. The toxin solution was then diluted 1M0O in gel-phosphate (30 mM 
Phosphate. P H 6.4: 0.2% gelatin). Further dilutions of the toxin solution were made as shown 
below in Table 39. Two mice were injected IP with 0.5 ml of each dilution shown and the 
mice were observed for symptoms of botulism for a period of 72 hours. 
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TABLE 39 



Dilution 


Number Dead At 72 hr 


1:320 


2/2 


1 :640 


2/2 


1:1280 


2/2 


1:2560 


0/2 (sick after 72 hr) 


1:5120 


0/2 (no symptoms) 



From the results shown in Table 39. the toxin titer was assumed to he between 2560 
UVml and 5120 LD v /ml (or about .3840 I.D ?n /ml>. This value was rounded to 3500 
LD M /ml for the sake of calculation. 

The amount of neutralizing antibodies present in the serum of rats immunized nasally 
with pMBoi protein was then determined. Serum from two rats boosted with pMBot protein 
as described above and preimmune serum from one rat was tested as follows. The toxin 
standard was diluted 1:100 in gel-phosphate to a final concentration of 350 LD M /ml. One 
milliliter of the diluted toxin standard was mixed with 25 ^tl of serum from each of the three 
rats and 0.2 ml of gel-phosphate. The mixtures were incubated at room temperature for 30 
min with occasional mixing. Ilach of two mice were injected with IP with 0.5 ml of the 
mixtures. The mice were observed for signs of botulism for 72 hr, Mice receiving serum 
from rats immunized with pMBot protein neutralized this challenge dose. Mice receiving 
preimmune rat serum died in less than 24 hr. 

I he amount of neutralizing anti-toxin antibodies present in the serum of rats 
immunized with pMBot protein was then quantilated. Serum antibody titrations were 
performed by mixing 0.1 ml of each of the antibody dilutions (see Table 40) with 0.1 ml of a 
1:10 dilution of stock toxin solution (3.5 x 10 4 I.IVml) wilh , () ml of gel-phosphate and 
injecting 0.5 ml IP into 2 mice per dilution. The mice were then observed for signs of 
botulism for 3 days (72 hr). The results are tabulated in Table 39. 

As shown in Table 40 pMBot serum neutralized ( \ hondhmm type A toxin complex 
when used at a -dilution of 1:320 or less. A mean neutralizing value of 168 IU/ml was 
obtained for the pMBot serum (an UJ is defined as 10.000 mouse LD<„). This value translates 
to a circulating serum titer of about 3.7 UJ/mg of serum protein. This neutralizing titer is 
comparable to the commercially available bottled concentrated (Connaught Laboratories. Ltd.) 
horse anti-C. hatulimtm antiserum. A 10 ml vial of Connaught antiserum contains about 200 
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mg/ml of proteimeach ml can neutralize 750 IU of C hotulimtm type A toxin. After 
administration of one vial to a human, the circulating serum titer of the Connaught 
preparation would be approximately 25 lU/ml assuming an average serum volume of 3 liters). 
Thus, the circulating anti-C. hotulinum titer seen in rats nasally immunized with pMBot 
protein (168 lU/ml) is 6.7 time higher than the necessary circulation liter of anti-C. hotulinum 
antibody needed to be protective in humans. 

TABLE 40 



Dilution 


pMBot* 


Rat 1 


Rai 2 


1:20 


2/2 


2/2 


1:40 


2/2 


2 '2 


1:80 


2/2 


2 


1:160 


2 '2 


*> n 


1:3.20 


2/2* 


2/2" 


1 :640 


0/2 


0/2 


1:1280 


0/2 


0/2 


1:2560 


0/2 


0/2 



Numbers represent the number of mice surviving at 72 hours which received serum taken from 
mis immunized with the pMBot protein. 

These mice survived but were sick after 72 hr. 

These results demonstrate that antibodies capable of neutralizing C. hotulinum type A 
toxin are induced when recombinant C hotulinum C fragment fusion protein produced in E. 
col i is used as an immunogen. 

EXAMPLE 24 

Production Of Soluble C. hotulinum C Fragment 
Protein Substantially Free Of Endotoxin Contamination 



F-xample 23 demonstrated that neutralizing antibodies are generated by immunization 
with the pMBot protein expressed in E. coli. These results showed that the pMBot fusion 
protein is a good vaccine candidate. However, immunogens suitable for use as vaccines 
should be pyrogen-free in addition to having the capability of inducing neutralizing 
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antibodies. Expression clones and conditions that facilitate the production of C baiulinum C 
fragment protein for utililization as a vaccine were developed. 

The example involved: (a) determination of pyrogen content of the pMBot protein: 
lb) generation of C hoiulinum C fragment protein free of the MBP: (c) expression of C. 
.1 bomlimim C fragment protein using various expression vectors: and (d) purification of soluble 
C hoiulinum C fragment protein substantially free of significant endotoxin contamination. 

si) Determination Of The Pyrogen Content Of The pMBnt 
Protein 

10 1,1 order lo use a recombinant antigen as a vaccine in humans or other animals, the 

antigen preparation must be shown to be free of pyrogens. The most significant pyrogen 
present in preparations of recombinant proteins produced in gram-negative bacteria, such as £ 
i-oli. is endotoxin |I-.C. Pearson. f'\mf>ens: endotoxins. l.AL testing anil Jc/nroyenmion. 
(1985) Marcel Dekker. New York. pp. 2.1-56). To evaluate the utility of the pMBot protein 
1 as a vaccine candidate, the endotoxin content in MBP fusion proteins was determined. 

I he endotoxin content of recombinant protein samples was assayed utilizing the 
l.imulus assay (l.AL kit: Associates of Cape Cod ) according to the manufacturer's 
instructions. Samples of affinity-purified pMal-c protein and pMA 1 870-2680 were found lo 
contain high levels of endotoxin f>50.000 EU/mg protein: I II (endotoxin unit)). This 
20 suggested that MBP- or toxin A repeat-containing fusions with the botulinal C fragment 
should also contain high levels of endotoxin. Accordingly, removal of endotoxin from 
affinity-purified pMal-c and pMBot protein preparations was attempted as follows. 

Samples of pMal-c and pMBot protein were depyrogenated with polymyxin to 
determine if the endotoxin could be easily removed. The following amount of protein was 
25 treated: 29 ml at 4.8 OD : Jm\ for pMal-c and 19 mis at 1.44 OD M /m\ for pMBot. The 
protein samples were dialyzed extensively against PBS and mixed in a 50 ml tube (Falcon) 
with 0.5 ml PBS-equilibratcd polymyxin B (Afll-Prep Polymyxin. BioRad). The samples 
were allowed to mix by rotating the tubes overnight at 4°C. The polymyxin was pelleted by 
centrifugation for 30 min in a bench top centrifuge at maximum speed (approximately 2000 x 
.-.') and the supernatant was removed. The recovered protein (in the supernatant) was 
quantified by 0D, S1I . and the endotoxin activity was assayed by LAI.. In both cases only 
approximately 1/3 of the input protein was recovered and the polymyxin-treated protein 
retained significant endotoxin contamination (approximately 7000 LTJ/mg of pMBot). 
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The depyrogenation experiment was repeated using an independently purified pMal-c 
protein preparation and similar results were obtained. From these studies it was concluded 
that significant levels of endotoxin copurif.es with these MBP fusion proteins using the 
amylose resin. Furthermore, this endotoxin cannot be easily removed by polymyxin 
treatment. 

These results suggest that the presence of the MBP sequences on the fusion protein 
complicated Hie removal of endotoxin from preparations of the pMBol protein. 

h) Generation Of C botulinum C Fragment Protein Free Of 
The MBP 

It was demonstrated that the pMBot fusion protein could not be easily purified from 
contaminating endotoxin in section a) above. The ability to produce a pyrogen-free (e. K .. 
endotoxin-free) preparation of soluble botulinal C fragment protein free of the MBP tag was 
"ex. investigated. The pMBoi expression construct was designed to facilitate purification of 
ihe botulinal C fragment from the MBP tag by cleavage of .he fusion protein by utilizing an 
engineered Factor Xa cleavage site present between the MBP and the botulinal C fragment. 
Hie factor Xa cleavage was performed as follows. 

f actor Xa (New England Biolabs) was added to the pMBot protein (using a 0.1-1.0% 
Factor Xa/pMBot protein ratio) in a variety of buffer conditions [e.g.. PBS-NaCl (PBS 
containing 0.5 M Nad). PBS-NaCl containing 0.2% Tween 20. PBS. PBS contaming 0.2% 
Twecn 20. PBS-C (PBS containing 2 mM CaCK). PBS-C containing either 0.1 or 0.5 % 
Tween 20. PBS-C containing either 0.1 or 0.5% NP-40. PBS-C containing either 0.1 or 0.5% 
Triton X-100. PBS-C containing 0.1% sodium deoxycholate. PBS-C containing 0.1% SDS]. 
Ihe factor Xa digestions were incubated for 12-72 hrs at room temperature. 

The extent of cleavage was assessed by Western blot or Coomassie blue staining of 
proteins following electrophoresis on denaturing SDS-PAGE gels, as described in Example 
22. Cleavage reactions (and control samples of uncleaved pMBot protein) were centriluged 
for 2 min in a microfuge to remove insoluble protein prior to loading the samples on the gel. 
The Factor Xa treated samples were compared with uncleaved. unccntrifuged pMBot samples 
on the same gel. The results of this analysis is summarized below. 

I ) Most (about 90%) pMBol protein could be removed by centrifugation. even 
when uncleaved control samples were utilized. This indicated that the pMBot fusion protein 
was not fully soluble (i.e.. it exists as a suspension rather than as a solution). [This result was 



- 155 - 



WO 98/08540 



PCT7US97/15394 



consistent with the observation that most affinity-purified pMBot protein precipitates after 
long term storage (>2 weeks) at 4°C. Additionally, the majority [i.e.. 75%) of induced 
pMBot protein remains in the pellet after sonication and clarification of the induced /•'. coll 
Rcsuspcnsion of these insoluble pellets in PBS followed by sonication results in partial 
solubilization of the insoluble pMBot protein in the pellets.) 

2) The portion of pMBot protein that is fully in solution (about 10% of pMBot 
protein) is completely cleaved by Factor Xa. but the cleaved (released) botulinal C fragment is 
relatively insoluble such that only the cleaved MBP remains fully in solution. 

3) None of the above reaction conditions enhanced solubility without also 
reducing effective cleavage. Conditions that effectively solubilized the cleaved botulinal C 
fragment were not identified. 

4) The use of 0.1% SDS in the buffer used for factor Xa cleavage enhanced the 
solubility of the pMBot protein (all of pMBot protein was soluble). However, the presence of 
the SOS prevented any cleavage of the fusion protein with factor Xa. 

5) Analysis of pelleted protein from the cleavage reactions indicated that both lull 
length pMBol {i.e.. unclcavcd) and cleaved botulinal C fragment protein precipitated during 
incubation. 

These results demonstrate that purification of soluble botulinal C fragment protein after 
cleavage of the pMBot fusion protein is complicated by the insolubility of both the pMBot 
protein and the cleaved botulinal C fragment protein. 

c) Expression Of C botulinum C Fragment Using Various 
Expression Vectors 

In order to determine if the solubility of the botulinal C fragment was enhanced by 
expressing the C fragment protein as a native protein, an N-terminal His-tagged protein or as 
a fusion with glutathione-S-transferase (GST), alternative expression plasmids were 
constructed. These expression constructs were generated utilizing the methodologies described 
in Example 22. Figure 27 provides a schematic representation of the vectors described below. 

In Figure 27. the following abbreviations are used. pP refers to the pET23 vector. 
pHIS refers to the pETHisa vector. pBlue refers to the pBluescript vector. pM* refers to the 
pMAL-e vector and pG refers to the pGEX3T vector (described in Example 1 1). The solid 
black lines represent ('. hotulinum C fragment gene sequences: the solid black ovals represent 
the MBP; the hatched ovals represent GST; "HHHHH" represents the poly-histidine tag. In 
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Figure 27. when the name for a restriction enzyme appears inside parenthesis, this indicates 
that the restriction site was destroyed during construction. An asterisk appearing with the 
name lor a restriction enzyme indicates that this restriction site was recreated at a cloning 



junction. 



i) Construction Of pPBot 

In order to express the C. bomlimim C fragment as a native t/.c. non-fused) protein, 
the pPBot plasmid (shown schematically in Figure 27) was constructed as follows. The C 
fragment sequences present in pAlterBot (Example 22) were removed by digestion of 
pAllerBot with Nco\ and HinJ\\\. The Ka>\lllmd\\\ C fragment insert was ligatcd to 
pETIIisa vector (described in Example 1 8b) which was digested with Ao>l and ///Will. This 
ligation creates an expression construct in which the Arol-encodcd methionine of the botulinal 
C fragment is the initiator codon and directs expression of the native botulinal C fragment. 
The ligation products were used to transform competent BL2l(DE3)pl.ysS cells (Novagen). 
Recombinant clones were ideniiiled by restriction mapping. 

ii) Construction Of pHisBot 

In order to express the C. homlintim C fragment containing a poly-liistidine lag at the 
ammo-terminus of the recombinant protein, the pHisBot plasmid (shown schematically in 
Figure 27) was constructed as follows. The hco\IHhu(\\] botulinal C fragment insert from 
pAlterbot was ligatcd into the pETIIisa vector which was digested with Xhd und IlindlU. 
The .Vol (on the C fragment insert) and Mid (on the piTHisa vector) sites were filled in 
using the Klenow fragment prior to ligation: these sites were then blunt end ligated (the At/el 
site was regenerated at the clone junction as predicted). The ligation products were used to 
transform competent BL21(DE3)pLysS cells and recombinant clones were identified by 
restriction mapping. 

I'he resulting pi lisBot clone expresses the botulinal C fragment protein with a 
histidine-tagged N-terminal extension having the following sequence: MetGlyl lisl lis 
HisHisllisllisHisHisHisllisScrSerClyHislleGluGlvAruHis MetAla. (SEQ II) NO:241: the amino 
acids encoded by the botulinal C fragment gene arc underlined and the vector encoded amino 
acids are presented in plain type. The nucleotide sequence present in the pETIIisa vector 
which encodes the pHisBot fusion protein is listed in SEQ ID NO:25. The amino acid 
sequence of the pHisBot protein is listed in SEQ ID NO:26. 
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iii) Construction Of pGBot 

The botulinal C fragment protein was expressed as a fusion with the glutathione-S- 
iransferase protein by constructing the pGBot plasmid (shown schematically in Fiuure 27). 
This expression construe, was created by cloning the AV„IAW1 C fragment insert present in 
pBlueHot (Example 22) into the pGHX3T vector which was digested with Smal and Xhol 
The AV*1 site (present on the botulinal fragment, was made blunt prior to ligation using the 
Menou fragment. The ligation products were used to transform competent BL2 I cells. 

Each of the above expression constructs were tested by restriction digestion to confirm 
the integrity of the constructs. 

Large scale ( I liter) cultures of pPBot [BL2l(DO)pLysS hostj. pHisBo. 
|BL2.(OE3, P LysS hostj and pGBot (BL21 host, were grown in 2X YT medium and induced 
(ustng 1PTG to 0.8-,. 0 mM) for 3 hrs as described in Example 22. Total, soluble and 
insoluble protein preparations were prepared from I ml aliquot, of each larue scale culture 
I Williams „ „/. „90 4) . stipra] und flna|yzcd by SDS .p AGH N() ^.^ ^ ^ 

delectable ,„ the P PBo< or pHisBo, samples by Coomassie staining, while a prominent 
insoluble band of the anticipated MW was detected in the pGBot sample. Soluble Ivsates of 
the pCiBot large scale (resuspended m PBS) or pIlisBo, large scale |resuspended i„ Novagen 
IX binding buffer (5 mM imidazole. 0.5 M NaCI. 20 mM Tris-IKI. pi I 7 .9)| cultures were 
prepared and used to affinity purify soluble affinity-tagged protein as follows. 

The pGBo. lysa.e was affinity purified on a glutathione-agarose resin (Pharmacia, 
exactly as described in Smith and Corcoran (Current Protocols in Molecular Biologv 
Supplement 28 (1994). pp. 16.7. 1 - 1(,7.7|. The pHisBo. protein was purified on the Ilis-Bincl 
resm (Novagen) utilizing the His-bind buffer kit (Novagen, exactly as described by 
manufacturer. 

Samples from the purification of both the pGBot and pHisBot proteins (including 
uninduced. induced, total, soluble, and affinity-purified e.uted protein, were resolved on SDS- 
PAGE gels. Following electrophoresis, proteins were analyzed by Coomassie staininu or bv 
Western blot detection utilizing a chicken anti-C. homlinnn, Type A toxoid antibody (as 
described in Example 22). 

These studies showed that the pGBot protein was almost entirely insoluble under the 
utilized conditions, while the pHisBo. protein was soluble. Affinity purification of the 
PHisBot protein on this first attempt was inefficient, both in terms of yield (most of the 
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immunoreactivc botulinal protein did not bind to the His-bind resin) and purity (the botulinal 
protein was estimated to comprise approximately 20% of the total eluted protein). 

d) Purification Of Soluble C. botulinum C Fragment Protein 
Substantially Free Of Endotoxin Contamination 

The above studies showed that the pHisBot protein was expressed in £ cnli as a 
soluble protein. However, the affinity purification of .his protein on the His-bind resin was 
very inefficient. In order lo improve the affinity purification of the soluble pHisBot protein 
(in terms of both yield and purity), an alternative poly-histidine binding affinity resin (Ni- 
NTA resin: Qiagen) was utilized. The Ni-NTA resin was reported to have a superior binding 
affinity <K d * I x 10" at pH 8.0: Qiagen user manual) relative to the His-bind resin. 

A soluble l> sate (in Novagen IX binding buffer) from an induced I liter 2X YT 
culture was prepared as described above. Briefly, the culture of pHisBot [BI21(Dn3)pLysS 
hos,| was grown at 37°C lo an OD„„ of 0.7 in I liter of 2X YT medium containing 100 ' 
Hg/ml ampicillin. 34 ug/ml chloramphenicol and 0.2% glucose. Protein expression was 
induced by the addition of IPTCi to I mM. Three hours after the addition of the IPTG. the 
cells were cooled for 15 min in a ice water bath and then centrifuged 10 min at 5000 rpm in 
a JAK) rotor (Beckman) at 4°C. The pellets were rcsuspended in a total volume of 40 mis 
Novagen IX binding buffer (5 mM imidazole. 0.5 M NaCl. 20 mM T.is-HCI. pi I 7.9). 
iransfcrrcd to two 35 ml Oakridge tubes and frozen at -70°C for at least I hr. The lubes 
were thavxed and the cells were lysed by sonication (4 X 20 second bursts using a Branson 
Soniller 450 with a power setting of 6-7) on ice. The suspension was clarified by 
cenirifugation for 20 min at 9.000 rpm (10.000 x K ) in a JA- 1 7 rotor (Beckman).' 

The soluble lysatc was brought to 0.1% NP40 and then was batch absorbed to 7 ml of 
a 1:1 slurry of Ni-NTA resin.binding buffer by stirring for I hr at 4°C. The slurry was 
poured into a column having an internal diameter of I or 2.5 cm (BioRad). The column was 
then washed sequentially with 15 mis of Novagen IX binding buffer containing 0.1% Nl'40. 
15 ml of Novagen IX binding buffer. 15 ml wash buffer (60 mM imidazole. 0.5 M NaCl. 20 
mM Tris-HCI. pH 7.9) an d 15 ml NaHP0 4 wash buffer (50 mM NaHPO,. pi I 7.0. 0.3 M 
NaCl. 10 % glycerol). The bound protein was eluted by probation of the resin using elution 
buffer ,50 mM NaHPO,. P H 4.0. 0.3 M NaCl. 10 % glycerol). The eluted protein was stored 



at 4°C. 
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15 



Samples of total, soluble and eluted protein were resolved by SDS-PAGE. Protein 
samples were prepared for electrophoresis as described in Example 22b. Duplicate gels were 
stained with Coomassie blue to visualize the resolved proteins and C hondhmm type A toxin- 
reactive protein was detected by Western blot analysis as described in Example 22b. A 
representative Coomassie stained gel is shown in Figure 28. In Figure 28. the following 
samples were loaded on the 12.5% acrylamide gel. Lanes 1-4 contain respectively total 
protein, soluble protein, soluble protein present in the tlow-through of the Ni-NTA column 
and affinity-purified pHisBol protein lie. protein released from the Ni-NTA resin by 
protonation). Lane 5 contains high molecular weight protein markers (BioRad). 

The purification of pHisBol protein resulted in a yield of 7 mg of affinity purified 
protein from a 1 liter starting culture of BL2l(DE3)pLysS cells harboring the pHisBot 
plasmid. The yield of purified pHisBot protein represented approximately 0.4% of the total 
soluble protein in the induced culture. Analysis of the purified pHisBot protein bv SDS- 
PAGE revealed that at least 90-05% 0 f the protein was present as a single band (Figure 28) of 
the predicted MW (50 kD). This 50 kL) protein band was immunoreaetive with anii-C. 
hotulinum type A toxin antibodies. The extinction coefficient of the protein preparation was 
determined to be 1.4 {using the Pierce BCA assay) or 1.45 (using the Lowry assay) OI)^ per 
I mg/ml solution. 

Samples of pi I neutralized eluted pHisBot protein were resolved on a KB 803 HPLC 
column (Shodex). Although Mis-tagged proteins are retained by this sizing column (perhaps 
due to the inherent metal binding ability of the proteins), the relative mobility of the pHisBot 
protein was consistent with that expected for a non-aggregated protein in solution. Most of 
the induced pHisBot protein was determined to be soluble under the growth and solubilization 
conditions utilized above (/.<.'.. greater than 90% of the pHisBot protein was found to be 
2r> soluble as judged by comparison of the levels of pHisBol protein seen in total and soluble 
protein samples prepared from BL2l(DE3)pLysS cells containing the pHisBot plasmid). 
SDS-PAGE analysis of samples obtained after centrifugation. extended storage at -20°C. and 
at least 2 cycles of freezing and thawing detected no protein loss (due to precipitation), 
indicating that the pHisBot protein is soluble in the elution buffer [i.e.. 50 mM NallPO,. pH 
30 4.0. 0.3 M NaCl. 10 % glycerol). 

Determination of endotoxin contamination in the affinity purified pHisBot preparation 
(alter pH neutralization) using the LAL assay (Associates of Cape Cod) delected no 
significant endotoxin contamination. The assay was performed using the cndpoinl 
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chromogenic method (without diazo-coupling) according to the manufacturers instructions. 
This method can detect concentrations of endotoxin greater than or equal to 0.03 EU/ml (EU 
refers to endotoxin units). The LAL assay was run using 0.5 ml of a solution comprising 0.5 
nig pHisBot protein in 50 mM NaWPO,. pH 7.0. 0.3 M NaCI. 10 % glycerol; 30-60 EU were 
detected in the 0.5 ml sample. Therefore, the affinity purified pHisBot preparation contains 
60-120 EU/mg of protein. FDA Guidelines for the administration of parenteral drugs require 
that a composition to be administered to a human contain less than 5 EU/kg body weight (The 
average human body weight" is 70 kg: therefore up to 349 EU units can be delivered in a 
parental dose.). Because very small amount of protein are administered in a vaccine 
preparation (generally in the range of 10-500 ug of protein), administration of affinity 
purified pHisBot containing 60-120 EU/mg protein would result in delivery of only a small 
percentage of the permissible endotoxin load. For example, administration of 10-500 ug of 
purified pllisBot lo a 70 kg human, where the protein preparation contains 60 EU/mg protein, 
results in the introduction of only ().(, to 30 EU 0.2 to 8.6% of the maximum allowable 
endotoxin burden per parenteral dose (less than 5 F.U/kg body weight)]. 

The above results demonstrate that endotoxin (IT'S) does not copurify with the 
pHisBot protein using the above purification scheme. Preparations of recombinant^ produced 
pl lisBot protein containing lower levels of endotoxin (less than or equal to 2 El J/ mg 
recombinant protein) may be produced hy washing the Ni-NTA column with wash buffer until 
the ()D. S „ returns to baseline levels [i.e.. until no more UV-absorbing material comes off of 
the column). 

I'he above results illustrate a method for the production and purification of soluble, 
hotulinal C fragment protein substantially free of endotoxin. 



EXAMPLE 25 

Optimization Of The Expression And Purification Of pHisBot Protein 

The results shown in Example 24d demonstrated that the pllisBot protein is an 
excellent candidate for use as a vaccine as it could be produced as a soluble protein in L vali 
and could be purified free of pyrogen activity. In order to optimize the expression and 
purification of the pl lisBot protein, a variety of growth and purification conditions were 
tested. 
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a) Growth Parameters 
i) Host Strains 

The influence of the host strain utilized upon the production of soluble pi lisBol 
protein was investigated. A large scale purification of pHisBot was performed [as described 
in Example 24d above] using the BL2I(DE3) host (Novagen) rather than the 
BL2l(I)E3)pLysS host. The deletion of the pLysS plasmid in the BL21(DE3) host yielded 
higher levels of expression due to de-repression of the plasmid's T7-lac promoter. However, 
the yield of affinity-purified soluble recombinant protein was very low (approximately 600 
ug/ liter culture) when purified under conditions identical to those described in Example 24d 
above. This result was due to the fact that expression in the BL2UDE3) host yielded very 
high level expression of the pIlisBot protein as insoluble inclusion bodies as shown by SDS- 
i'AGE analysis of protein prepared from induced BL2!(DE3) cultures ( Figure 29. lanes 1-7. 
described below). These results demonstrate that the pHisBot protein is not inherently toxic 
to E coli cells and can be expressed to high levels using the appropriate promoter/host 
combination. 

f igure 29 shows a Coomassie blue stained SDS-PAGE gel (12.5% aerylamide) onto 
which extracts prepared from BL2l(DE3) cells containing the pHisBot plasmid were loaded. 
Each lane was loaded with 2.5 ul protein sample mixed with 2.5 ul of2X SDS sample buffer. 
The samples were handled as described in Example 22b. The following samples were applied 
to the gel. Lanes 1-7 contain protein isolated from the BL2I(DE3) host. I.ancs 8-14 contain 
proteins isolated from the BL2I (DE3)pl,vsS host. Total protein was loaded in lanes 1. 2. 4. 
6. 8. 10 and 12. Soluble protein was loaded in Lanes 3. 5. 7. 9. 1 1 and 1.3. Lane 1 contains 
protein from uninduced host cells. Lanes 2-13 contain protein from host cells induced for 3 
hours. IPTG was added to a final concentration of 0.1 mM (Lanes 6-7). 0.3 mM (Lanes 4-5) 
or !.() mM (Lanes 2. 3. 8-13). The cultures were grown in LB broth (Lanes 8-9). 2X YT 
broth (Lanes 10-1 1 ) or terrific broth (Lanes 1-7. 12-13). The pHisBot protein seen in Lanes 
3. 5 and 7 is insoluble protein which spilled over from Lanes 2. 4 and 6. respectively. High 
molecular weight protein markers (BioRad) were loaded in Lane 14. 

A variety of expression conditions were tested to determine if the BI.21(DF3) host 
could be utilized to express soluble pHisBot protein at suitably high levels (i.e.. about 10 
mg/ml). The conditions altered were temperature (growth at 37 or 30°C). culture medium 
<2X YT. LB or Terrific broth) and inducer levels (0.1. 0.3 or 1.0 mM IPTG). All 
combinations of these variables were tested and the induction levels and solubility was then 
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assessed by SDS-PAGE analysis of total and soluble extracts [prepared from 1 ml samples as 
described in Williams ei al., (1994). supra). 

All cultures were grown in 15 ml tubes (Falcon #2057). All culture medium was 
prewnrmed overnight at the appropriate temperature and were supplemented with 100 M g/ml 
ampicillin and 0.2% glucose. Terrific broth contains 12 g/1 bacto-tryptone. 24 g/l bacto-veas, 
extract and 100 ml/1 of a solution comprising 0.17 M KH,PO, 0.72 M K,IIP0 4 . Cultures 
were grown in a incubator on a rotating wheel (to ensure aeration) to an OI) ((H1 of 
approximately 0.4. and induced by the addition of IPTG. In all cases. hi«h level expression 
<•< -n.so.uble pHisBot protein was observed, regardless of temperature, medium or inducer 
concentration. 

The effect of varying the concentration of IPTG upon 2X YT cultures urown a. ?3°C 
was then investigated. IPTG was added to a final concentration of either 1 mM 0 , mM 
<>.05 mM or (».0, mM. A, th.s temperature, similar levels of pHis Hot protein was induced ,n 
•he presence of either I or 0. 1 mM IPTG: these levels of expression was lower than that 
observed at higher temperatures. Induced protein levels were reduced a. 0.05 mM IPTG and 
unscn, at 0.01 mM IPTG (relative ,o 1.0 and 0.1 mM IPTG inductions a. 23V). However 
no conditions were observed in which the induced pHisBot protein was soluble in this host 
Ihus. although expression levels are superior in the BL2KDF.3) hos, ,as compared to the 
BL2l(l3E3) P LysS host), conditions that facilitate the production of soluble protein in this host 
could not be identified. 

These results demonstrate that production of soluble pHisBot protein was achieved 
using the BL2l(DE3)p|.y S S host in conjunction with the T7-lac promoter. 

ii) Effect Of Varying Temperature, Medium And 
IPTG Concentration And Length Of Induction 

The effect growing the host cells in various mediums upon the expression of 
recombinant botulinal protein from the pHisBot expression construct [in the BL^HDmpI vsS 
hostl was investigated. BL21(Dn 3 )pLysS cells containing the pHisBot plasmid were ,rown 
m ether LB. 2X YT or Terrific broth a, 37°C. The cells were induced usin* , mM IPTG for 
u - hr mduct.on period. Expression of pHisBot protein was found to be the hiuhest when the 
cells were grown in 2X YT broth (see Figure 29. lanes 8-13). 

The cells were then grown at 30°C in 2X YT broth and the concentration of IPTG was 
vancd front 1 .0. 0.3 or 0. 1 mM and the length of induction was either 3 or S hou 



was 
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Fxpression of pHisBot protein was similar at a!! 3 inducer concentrations utilized and the 
levels of induced protein were higher after a 5 hr induction as compared to a 3 hr induction. 

Using the conditions found to be optimal for the expression of pHisBot protein, a large 
scale culture was grown in order to provide sufficient material for a large scale purification of 
the pHisBot protein. Three 1 liter cultures were grown in 2X YT medium containing 100 
Mg/ml ampiciliin. 34 ^ig/ml chloramphenicol and 0.2% glucose. The cultures were grown at 
30°C and were induced with 1.0 mM iPTG for a 5 hr period. The cultures were harvested 
and a soluble lysatc were prepared as described in Example 18. A large scale purification 
was performed as described in Example 24d with the exception thai except the soluble lysatc 
was batch absorbed for 3 hours rather than for I hour. The final yield was 13 mg pHisBot 
protein/liter culture. The pHisBot protein represented 0.75% of the total soluble protein. 

The above results demonstrate growth conditions under which soluble pHisBot protein 
is produced {i.e.. use of the BL2I(L)E3)pLysS host. 2X YT medium. 30°C. 1.0 mM IPTG for 
5 hours). 

b) Optimization Of Purification Parameters 

for optimization of purification conditions, large scale cultures (3 X 1 liter) were 
grown at 30°C and induced with 1 mM IPTG for 5 hours as described above. The cultures 
were pooled, distributed to centrifuge bottles, cooled and pelleted as described in Example 
24d. The cell pellets were frozen at -70°C until used. Each cell pellet represented 1/3 of a 
liter starting culture and individual bottles were utilized for each optimization experiment 
described below. This standardized the input bacteria used for each experiment, such that the 
yields of affinity purified pHisBot protein could be compared between different optimization 
experiments. 



i) Binding Specificity (pH Protonation) 

A lysate of pHisBot culture was prepared in PBS (pll 8.0) and applied to a 3 ml Ni- 
KTA column equilibrated in PBS (pll 8.0) using a How rate of 0.2 ml/min (3-4 column 
volumes/hr) using an Econo chromatography system (BioRad). The column was washed with 
PBS (pl l 8.0) until the absorbance (OI) :x j of the elulc was at baseline levels. The flow rate 
was then increased to 2 ml/min and the column was equilibrated in PBS (pll 7.0). A pll 
gradient (pH 7.0 to 4.0 in PBS) was applied in order lo elulc the bound pHisBot protein from 
the column. Fractions were collected and aliquots were resolved on SDS-PAGE gels. The 



- 164 - 



WO 98/08540 PCT/US97/15394 

PAGE gels were subjected to Western blotting and the pHisBot protein was detected using a 
chicken ami-C. hotulinum Type A toxoid antibody as described in Example 22. 

From the Western blot analysis it was determined that the pHisBot protein begins to 
elute from the Ni-NTA column at pH 6.0. This is consist™, with the predicted elution of a 
His-tagged protein monomer at pH 5.9. 

These results demonstrate that the pH at which the pHisBot protein is protonated 
< released l from Ni-NTA resin in PBS buffer is pH 6.0. 

ii) Binding Specificity (Imidazole Competition) 

In order to define purification conditions under which the native E coli proteins could 
be removed from the Ni-NTA column while leaving the pHisBot protein bound to the 
column, the following experiment was performed. A lysate of pHisBot culture was prepared 
m 50 mM Nal.PO, 0.5 M NaCI. 8 mM imidazole (pll 7.0). This lysate was applied to a 3 
ml Ni-NTA column equilibrated in 50 mM NaHPO,. 0.5 M NaCI (pH 7.0) using an Econo 
chromatography system (BioRad). A How rate of 0.2 ml/min (3-4 column volumes/hr) was 
utilized. The column was washed with 50 mM NaHP0 4 . 0.5 M NaCI (pH 7.0) until the 
absorbancc of .he elute returned to baseline. The flow rate was then increased lo 2 ml/min. 

I lie column was elutcd using an imidazole step gradient |in 50 mM NaHPO,. 0.5 M 
NaCI (pll 7.0)|. Hlution steps were 20 mM. 40 mM. 60 mM. 80 mM. 100 mM. 200 mM. 1.0 
M imidazole, followed by a wash using 0.1 mM EDTA (to strip the nickel from the column 
and remove any remaining protein). In each step, the wash was continued until the OD,„ 
returned to baseline, fractions were resolved on SDS-PAGF. gels. Western blotted, and " 
PHisBot protein detected using a chicken anti-C. boiuiimm, Type A toxoid antibody as 
described in Example 22. Duplicate geis were stained with Coomassie blue to detect elated 
protein in each fraction. 

The results of the PAGE analysis showed that most of the non-specitlcallv binding 
bacterial protein was removed by the 20 mM imidiazole wash, with the remaining bacterial 
proteins being removed in the 40 and 60 mM imidazole washes. The pllisBot protein began 
to elute at 100 mM imidazole and was quantitatively eluted in 200 mM imidazole. 

These results precisely defined the window of imidazole wash stringency that 
optimally removes E coli proteins from the column while specifically retaining the pHisBot 
protein in this buffer. These results provided conditions under which the pHisBo, protein can 
be purified free of contaminating host proteins. 
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iii) Purification Buffers And Optimized 
Purification Protocols 

A variety of purification parameters were tested during the development of an 
optimized protocol for batch purification of soluble pHisBol protein. The results of these 
5 analyses arc summarized below. 

Batch purifications were performed (as described in Kxample 24d) using several 
buffers to determine if alternative buffers could be utilized for binding of the pHisBot protein 
to the Ni-NTA column. It was determined that quantitative binding of pMisBot protein to the 
Ni-NTA resin was achieved in either Tris-HCl (pi I 7,')) or NaHP0 4 (pi I 8.0) buffers. 

10 Binding of the pHisBot protein in NaHPO, buffer was not inhibited using 5 mM. 8 mM or 60 
m.M imidazole. Quantitative elution of bound pHisBot protein was obtained in buffers 
containing 50 mM NaHPO,. 0.3 M NaCI (pH 3.5-4.0), with or without 10% glycerol. 
However, quantitation of soluble affinity purified pHisBot protein before and after a freeze 
thaw (following several weeks storage of the affinity purified elutc at -2()°C) revealed that 

b c )4% of the protein was recovered using the glycerol-containing buffer, but only 68% of the 
protein was recovered when the buffer lacking glycerol was employed. This demonstrates 
tiiat glycerol enhanced the solubility of the pHisBot protein in this low P M buffer when the 
eluted protein was stored at freezing temperatures U'.tf.. -2()°C). Neutralization of pH by 
addition of NaFLPO., buffer did not result in obvious protein precipitation. 

- () Jt vvas determined that quantitative binding of pHisBot protein using ihe batch format 

occurred after 3 hrs (Figure 30). but not after 1 hr of binding at 4°C (the resin was stirred 
during binding). Figure 30 depicts a Coomaisse blue stained SDS-PAGK gel < 7.5% 
acrylamide) containing samples of proteins isolated during the purification of pHisBot protein 
from lysaie prepared from the BL21(DE3)pLysS host. Hach lane was loaded with 5 jil of 

25 protein sample mixed with 5 ja! of 2X sample buffer and processed as described in Kxample 
22b. Lane I contains high molecular weight protein markers (BioRad). Lanes 2 and 3 
contain protein eluted from the Ni-NTA resin. Lane 4 contains soluble protein after a 3 hr 
batch incubation with the Ni-NTA resin. Lanes 5 and 6 contain soluble and total protein, 
respectively. Figure 30 demonstrates that the pHisBot protein is completely soluble [compare 

30 Lanes 5 and 6 which show that a similar amount of the 50 kD pHisBot protein is seen in 
both: if a substantial amount (greater than 20%) of the pHisBot protein were partially 
insoluble in the host cell, more pHisBot protein would be seen in lane 6 (total protein) as 
compared to lane 5 (soluble protein)]. Figure 30 also demonstrates that the pHisBot protein is 
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completely removed from the lysatc after batch absorption with the Ni-NTA resin for 3 hours 
(compare Lanes 4 and 5). 

The reported high affinity interaction of the Ni-NTA resin with His-tagged proteins 
(Kj= i x !0 ' - at pH 8.0) suggested that it should be possible to manipulate the resin-protein 
complexes without 'significant release of the bound protein. Indeed, it was determined that 
after the recombinant protein was bound to the Ni-NTA resin, the resin-pHisBot protein 
complex was highly stable and remained bound following repeated rounds of centrifugation of 
the resin lor 2 min at 1600 x y. When this centrifugation step was performed in a 50 ml tube 
(Falcon), a light resin pellet formed. This allowed the removal of spent soluble lysatc by 
pouring off the supernatant followed by resuspension of the pellet in wash buffer. Further 
washes can be performed by centrifugation. The ability to perform additional washes permits 
the development of protocols for batch absorption of large volumes of lysatc with removal of 
the l> sate being performed simply by centrifugation following binding of the recombinant 
protein to the resin. 

A simplified, integrated purification protocol was developed as follows. A soluble 
lysate was made by rcsuspending the induced cell pellet in binding buffer [50 mM NallPO,. 
0.5 M NaCI. 60 mM imidazole (pM 8.0)|. sonicating 4 x 20 sec and centrifuging for 20 min 
at 10.000 x M . NIM0 was added to 0.1% and Ni-NTA resin (equilibrated in binding buffer) 
was added. Light milliliters of a 1:1 slurry (resin:binding buffer) was used per liter of 
starting culture. The mixture was stirred tor 3 hrs at 4°C. The slurry was poured into a 
column having a I cm internal diameter (BioRad). washed with binding buffer containing 
0.1% Nl>40. then binding buffer until baseline was established (these steps may alternatively 
be performed by centrifugation of the resin, resuspension in binding buffer containing NP40 
followed by centrifugation and resuspension in binding buffer). Imidazole was removed by 
washing the resin with 50 mM NaHPO, 0.3M NaCI ( P H 7.0). Protein bound to the resin was 
elutcd using the same buffer (50 mM NaHPO,, 0.3M NaCI) having a reduced pH (pH 3.5- 
4.0). 

A pilot purification was performed following this protocol and yielded 18 mg/liter 
affinity-purified pMisBot. The pHisBot protein was greater than 90% pure as estimated by 
Coomassie staining of an SDS-PAGE gel. This represents the highest observed yield of 
soluble affinity-purified pHisBot protein and this protocol eliminates the need for separate 
imidazole-containing binding and wash buffers. In addition to providing a simplified and 
efficient protocol for the affinity purification of recombinant pHisBot protein, the above 
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results provide a variety of purification conditions under which pHisBot protein can be 
isolated. 



EXAMPLE 26 

The pHisBot Protein Is An Effective Immunogen 

In Example 23 it was demonstrated that neutralizing antibodies arc generated in mouse 
serum alter nasal immunization with the pMBot protein. However, the pMBot protein was 
found to copurify with significant amounts of endotoxin which could not be easily removed. 
The pHisBot protein, in contrast, could be isolated free of significant endotoxin contamination 
making pi lisBot a superior candidate for vaccine production. To further assess the suitability 
of pHisBot as a vaccine, the immunogenicity of the pHisBot protein was determined and a 
comparison of the relative immunogenicity of pMBot and pHisBot proteins in mice was 
performed as follows. 

Two groups of eight BAl.Bc mice were immunized with either pMBot protein or 
pHisBot protein using Gerbu CiMDP adjuvant «X Biotech). pMBot protein (in PBS 
containing 10 mM maltose) or pHisBot protein (in 50 mMNuHPO,. 0.3 M NaCI. 10% 
glycerol. pH 4.0) was mixed with Gerbu adjuvant and used to immunize mice. Each mouse 
received an IP injection of 100 ul antigen/adjuvant mix (50 M g antigen plus 1 ug adjuvant) on 
day 0. Mice were boosted as described above with the exception that the route of 
administration was 1M on day 14 and 28. The mice were bled on day 77 and anti-C. 
hoiulimm, Type A toxoid titers were determined using serum collected from individual mice 
in each group (as described in Example 23). The results are shown in Table 41. 
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The results shown above in Table 41 demonstrate that both the pMBot and pHisBot 
proteins are immunogenic in mice as 100% of the mice (8/8) in each group serocon verted 
from non-immune to immune status. The results also show that the average titer of anti-C 
homlmwu Type A toxoid lg(i is 2-3 fold higher after immunization with the pllisBot protein 
relative to immunization with the pMBot protein. This suggests that the pllisBot protein may 
be a superior immunogen to the pMBot protein. 



EXAMPLE 27 

Immunization With The Recombinant 
pHisBot Protein Generates Neutralizing Antibodies 

The results shown in Example 26 demonstrated that both the pl lisBot and pMBot 
proteins were capable of inducing high titers of anti-C bnmlinun, type A toxoid-reactive 
antibodies in immunized hosts. The ability of the immune sera from mice immunized with 
either the pHisBot or pMBot proteins to neutralize C hondinum type A toxoid in v/m was 
determined using the mouse neutralization assay described in Example 23b. 
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The two groups of eight BALBc mice immunized with either pMBot protein or 
pHisBot protein in Example 26 were boosted again one week after the bleeding on day 77. 
The boost was performed by mixing pMBot protein (in PBS containing 10 mM maltose) or 
pHisBot protein (in 50 mM NaHP0 4 . 0.3 M NaCl. 10% glycerol, pll 4.0) with Gerbu 
adjuvant as described in Example 26. Each mouse received an IP injection of 100 ul 
antigen/adjuvant mix (50 p.g antigen plus I ug adjuvant). The mice were bled 6 days after 
this boost and the serum from mice within a group was pooled. Serum from preimmune mice 
was also collected (this serum is the same serum described in the footnote to Tabic 41 ). 

The presence of neutralizing antibodies in the pooled or preimmune serum was 
detected by challenging mice with 5 I.D 50 units of type A toxin mixed with 100 ul ol pooled 
serum. The challenge was performed by mixing (per mouse to be injected) 100 M l of scrum 
from each pool with 100 M l of purified type A toxin standard (50 I.D<„ /ml prepared as 
described in Hxamplc 23b) and 500 ul of gel-phosphate. The mixtures were incubated for 30 
min at room temperature with occasional mixing. Each of four mice were injected IP with 
Ihe mixtures (0.7 ml/mouse). The mice were observed for signs of botulism for 72 hours. 
Mice receiving toxin mixed with serum from mice immunized with either the pliisBoi or 
pMBot proteins showed no signs of botulism intoxication. In contrast, mice receiving 
preimmune serum died in less than 24 hours. 

These results demonstrate that antibodies capable of neutralizing ( : hoiulinum type A 
toxin are induced when cither of the recombinant C. hntuiimim C fragment proteins pHisBot 
or pMBot are used as immunogens. 

EXAMPLE 28 

Cloning And Expression Of The C Fragment of ( ' hoiulimtm 
Serotype A Toxin In E. coli Utilizing A Native Gene Fragment 

In Example 22 above, a synthetic gene was used to express the C fragment of ( '. 
hoiulinum serotype A toxin in E. co/i. The synthetic gene replaced non-preferred [i.e.. rare) 
codons present in the C fragment gene with codons which are preferred by coli. The 
synthetic gene was generated because it was been reported that genes which have a high A/T 
content (such as most clostridial genes) creates expression difficulties in E. coli and yeast. 
Furthermore. I.aPenoticrc vi ul. suggested that problems encountered with the stability (non- 
fusion constructs) and solubility (MBP fusion constructs) of the C fragment of C hutulinum 
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•serotype A toxin when expressed in E. co/i was most likely due to the extreme A/T richness 
ol the native C. hotuliman serotype A toxin gene sequences (LaPenotiere. e , at., supra). 

In this example, i, was demonstrated that successful expression of the C fragment of 
< *»/«/«,,«„ type A toxin gene in E. cn,i does not require the elimination of rare codons 
('-.. ihere is no need to use a synthetic gene). This example involved a, the cloning of the 
nauvc L fragment of the C ho.ulim.m serotype A toxin gene and construction of an 
cxpa-ss.cn vector and b) a comparison of the expression and purification vields of C 
'^•Innnn serotype A C fragments derived from native and synthetic expression vectors. 

a) Cloning Of The Native C Fragment Of The C botulinum 

Serotype A Toxin Gene And Construction Of An Expression 
Vector 

The serotype A toxin gene was cloned from C. W/,„„„ gnomic DNA usinu PCR 

amplication. The following primer pa.r was employed: 5 -CGCCATGGCTAG 

A ITATTATCTACATTTACV ( 5" primer. Ncol site underlined: SRQ ID NO ">*)) and 

^ -CK AAGiXTCTTCiACAGArTCAKiTAGO- (.V primer. Hinilll site underlined: SEQ ID 
NO:,0). ( hltwlimm tvpc A stram was obtajncd f . om Amer . cun r ( ^ ( ui(ure 

•A ICC #19397, and grown under anaerobic conditions in Terrillc broth medium Hum 
moleculur-weight C Hon.linunr DNA was isolated as described in Example ... The integrity 
and > ,<d of genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA alter electrophoresis on an agarose gel. 

The gene fragment was cloned by PCR utilizing a proofreading thermostable DNA 
polymerase (native />/„ po.ymera.se). PCR amplification was performed using the above 
primer pa.r in a 50 M I reaction containing lOmM Tris-HCI (pH X.3). 50mM KCI 1 5mM 
■VlgCI, 200„M each dNTP. 0.2 M M each primer, and 50ng C. ho.ulinun, genomic DNA 
Reaeuons were overlaid with I00 M 1 mineral oil. heated to 94 C 4 min. 0.5„l native />/„ 
polymerase (Stratagene, was added, and thirty cycles comprising 94"C for I min. WC , or , 
mm. 72-C for 2 min were carried out fol.owed by 10 min at 72'C. An aliquot (.0„l) of tlJ 
rcacuon m.xturc was resolved on an agarose gel and the amplified native C fraumcnt BC „e 
was gel punlted using the Prep-A-Gene kit (BioRad) and ligated to pCRScrip, vector DNA 
•Stratagene,. Recombinant clones were iso.ated and confirmed by restricts digestion using 
standard recombinant molecular biology techniques |Sambrook at. (.989) supra] In 
addmon. the sequence of approximately 300 bases located a, the 5' end of the C fragment 
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coding region were obtained using standard DNA sequencing methods. The sequence 
obtained was identical to that of the published sequence. 

An expression vector containing the native C. botulinum serotype A C fragment gene 
was created by ligation of the A'cv>I-/Y/wdIIl fragment containing the C fragment gene from the 
pCRSeript clone to NheWIindlU restricted pETHisa vector (Example 18b). The Nco\ and 
A7k'I sites were filled in using the Klenow enzyme prior to ligation: these sites were thus 
blunt-end ligated together. The resulting construct was termed pHisRolA (native). pHisBotA 
(native) expresses the ('. botulinum serotype A C fragment with a his-tagged N terminal 
extension which has the following sequence: 

MetCHyHisHisIIisHislIisHisHis^ ( j>EQ ID 

NO:24). where the underlining represents amino acids encoded by the (\ botulinum C 
fragment gene (this N terminal extension contains the recognition site for EactorXa protease, 
shown in italics, which can be employed to removed the polyhistdine tract from the N- 
terminus of the fusion protein). The pHisRot (native) construct expresses the identical protein 
as the pl lisBot construct (Ex. 24c: herein after the pHisBotA) which contains the synthetic 
gene. 

The predicted DNA sequence encoding the native C. hotulimtm serotype A C fragment 
gene contained within pHisBotA (native) is listed in SEQ 11) N():3I [the start of translation 
(AT(i) is located at nucleotides 108-1 10 and the stop of translation (TAA) is iocatcd at 
nucleotides 1494- 14% in SEQ ID N():3l] and the corresponding amino acid sequence is listed 
in SEQ ID NO:26 (/.c\. the same amino acid sequence as that produced by pHisBotA 
containing synthetic gene sequences). 

b) Comparison Of The Expression And Purification Yields Of 
C botulinum Serotype A C Fragments Derived From Native 
And Synthetic Expression Vectors 

Recombinant plasmids containing either the native or the synthetic (' botulinum 
serotype A C fragment genes were transformed into coli strain B121(DE3) pLysS and 
protein expression was induced in 1 liter shaker flask cultures. Total protein extracts were 
isolated, resolved on SDS-PAGE gels and ('. botulinum C fragment protein was identified by 
Western analysis utilizing a chicken anti-C. botulinum serotype A toxoid antiserum as 
described in Example 22. 
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Briefly. I liter (2XYT + 100 ng/ml ampicillin and 34 Mg/ml chloramphenicol) cultures 
of bacteria harboring either the pHisBotA (synthetic) or pHisBolA (native) plasmids in the 
BI2KDF-3) pl.ysS strain were induced to express recombinant protein by addition of IPTG to 
ImM. C ultures were grown at 30-32°C. IPTG was added when the cell density reached an 
()D,, m , 0.5-1.0 and the induced protein was allowed to accumulate for 3-4 hrs after induction. 

The cells were cooled for 15 min in a ice water bath and then centrifuged for 10 min 
at 5000 rpm in a JA10 rotor (Beckman) at 4°C. The cell pellets were resuspended in a total 
volume of 40 mis IX binding buffer (40 mM imidazole. 0.5 M NaCl. 50 mM NaP0 4 . pH 
X.0). transferred to two 50 ml Oakridge tubes and frozen at -70°C for at least 1 hr. The tubes 
were then thawed and the cells were lyscd by sonication (using four successive 20 second 
bursts) on ice. The suspension was clarified by centrifugation 20-30 min at 9.000 rpm 
(lO.OOO.u) in a JA-17 rotor. The soluble lysate was batch absorbed to 7 ml of a 1:1 slurry of 
NiNTA resin.binding buffer by stirring 2-4 hr at 4°C. The slurry was centrifuged for I min 
at 5W) K in 50 ml tube (I-'alcon). resuspended in 5 mis binding buffer and poured into a 2.5 cm 
diameter column (BioRacI). The column was attached to a UV monitor (ISCO) and the 
column was washed with binding buffer until a baseline was established. Imidazole was 
removed by washing with 50mM NaPOj. 0.3 M NaCl. 10% glycerol. pH 7.0 and bound 
protein was eluted using 50mM NaP() 4 . 0.3 M NaC). 10% glycerol. pH 3.5-4.0. 

The eluied proteins were stored at 4°C. Samples of total, soluble, and eluted proteins 
were resolved by SOS-PAGE. Protein samples were prepared for electrophoresis by mixing 
I Ml uual (T) or soluble (S) protein with 4 ul PBS and 5 ul 2X SOS-PAGE sample buffer, or 
5 ul eluted <E) protein and 5 M l 2X SDS-PAGL sample buffer. The samples were heated to 
95T for 5 min. then cooled and 5 or 10 fi\s were loaded on 12.5% SDS-PAGE gels. Broad 
range molecular weight protein markers (BioRad) were also loaded to allow the MW of the 
identified fusion proteins to be estimated. After electrophoresis, protein was detected either 
generally by staining gels with Coomassie blue, or specifically, by blotting to nitrocellulose 
lor Western blot detection of specific immunoreactive protein. 

I or Western blol analysis, the gels were blotted, and protein transfer was confirmed by 
Ponceau S staining as described in Example 22. After blocking the blots for I hr at room 
temperature in blocking buffer (PBST and 5% milk). 10 ml of a 1/500 dilution of an anti-C. 
hmulinum toxin A IgY PRO prep (Ex. 3) in blocking buffer was added and the blots were 
incubated for an additional hour at room temperature. The blots were washed and developed 
using a rabbit anti-chicken alkaline phosphatase conjugate (Boehringer Mannheim) as the 
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secondary antibody as described in Ex. 22. This analysis detected C hcuulinum toxin A- 
.eactivc proteins in the pHisBotA (native and synthetic) protein samples (corresponding to the 
predicted full length proteins identified by Coomassie staining). 

A gel containing proteins expressed from the pHisBot and pHisBot (native) constructs 
during various stages of purification and stained with Coomassie blue is shown in Figure 31. 
In Figure 31. lanes 1-4 and 9 contain proteins expressed by the pHisBotA construct (La., the 
synthetic gene) and lanes 5-8 contain proteins expressed by the pHisBotA (native) construct. 
Lanes I and 5 contain total protein extracts; lanes 2 and 6 contain soluble protein extracts; 
lanes 3 and 7 contain proteins which flowed through the NiNTA columns: lanes 4. 8 and 9 
contain protein elated from the NiNTA columns and lane 10 contains molecular weight 
markers. 

The above purification resulted in a yield of 3 mg (native gene) or 1 1 mg (synthetic 
gene) of affinity purified protein from a I liter starting culture, of which at least 90-95% of 
the protein was a single band of the predicted MW (50kd) and immunoreactivity for 
recombinant C homlhmm serotype A C fragment protein. Other than the level of expression, 
no difference was observed between the native and the synthetic gene expression systems. 

These results demonstrate that soluble C. Inuulwum serotype A C fragment protein can 
be expressed in E. col, and purified utilizing either native or synthetic gene sequences. 

EXAMPLE 29 

Generation Of Neutralizing Antibodies Using A Recombinant 
C hotulinum Serotype A C Fragment Protein Containing A Six Residue His-Tag 

In Example 27. neutralizing antibodies were generated utilizing the pHisBotA protein, 
which contains a histidinc-tagged N-terminal extension comprising 10 histidine residues. To 
determine if the generation of neutralizing antibodies is dependent on the presence of this 
particular his-tag. a protein containing a shorter N-terminal extension (comprising 6 histidine 
residues) was produced and tested for the ability to generate neutralizing antibodies. This 
example involved a) the cloning and expression of the p6HisBo.A(syn) protein and b) the 
generation and characterization of hyperimmune scrum. 
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a) Cloning And Expression Of The p6HisBotA(syn) Protein 

The pfiHisBotA(syn) construct was generated as described below: the term "syn" 
designates the presence of synthetic gene sequences. This construct expresses the C frgament 
of the C. hoiulinum serotype A toxin with a histidine-tagged N terminal extension having the 
following sequence: MetHisHisHisHisHisHisMetAJa (SEQ ID NO:32): the amino acids " 
encoded by the botulinal C fragment gene are underlined and the vector encoded amino acids 
are presented in plain type. 

6XHis oligonucleotides (5*-TATGCATCACCATCACCATCA-3* (SEQ ID NO:33) and 
5 -CATGTGATGGTGATGGTGATGCA-3- (SEQ ID NO:34) were annealed as follows. One 
microgram of each oligonucleotide was mixed in total of 20 ul IX reaction buffer 2 (NEB) 
and the mixture was healed at 70°C for 5 min and then incubated at 42°C for 5 min. The 
annealed oligonucleotides were then iigated with gel purified MeVHindlU cleaved pET23b 
(T7 promoter) or pET21b (T7lac promoter) DNA and the gel purified Nco\/flindU\ C. 
hoiulinum serotype A C fragment synthetic gene fragment derived from pAlterBot (Ex. 22). 
Recombinant clones were isolated and confirmed by restriction digestion. The DNA sequence 
encoding the 6X his-tagged BotA protein contained within p6HisBotA(syn) is listed in SEO 
ID NO:35. The amino acid sequence of the P 6XHisBotA protein is listed in SEQ ID NO:36. 

The resulting recombinant p6XIiisBolA plasmid was transformed into the BI.2KDE3) 
pLysS strain, and 1 liter cultures were grown, induced and harvested as described in Example 
28. Mis-lagged protein was purified as described in Example 28. with the following 
modifications. The binding buffer (BB) contained 5 mM imidazole rather than 40 mM 
imidazole and Nl>40 was added to the soluble lysate to a final concentration of 0.1%. The 
bound material was washed on the column with BB until the baseline was established, then 
the column was washed successively with BB+20 mM imidazole and BB+40 mM imidazole. 
The column was eluted as described in Example 28. 

In the case of the pET23-derived expression system, high level expression of insoluble 
6HisBotA protein was induced. The pET2 1 -derived vector expressed lower levels of soluble 
protein that bound the NiNTA resin and eluted in the 40 mM imidazole wash rather than 
during the low pH elulion. These results [i.e.. low level expression of a soluble protein) are 
consistent with the results obtained with pHisBotA protein (Ex. 25): the pHisBotA construct, 
like the pET2 1 -derived vector, contains the T7lac rather than T7 promoter. 

The 6HisBotA protein thus clutes under less stringent conditions than the 10X 
histidine-coniaining pHisBot protein < 100-200 mM imidazole: Ex. 25) presumably due to the 
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reduction in ihe length of the his-tag. The eluted protein was of the predicted size [i.e., 
slightly reduced in comparison to pHisBolA protein]. 

b) Generation And Characterization Of Hyperimmune Scrum 

Eight BALBc mice were immunized with purified 6HisBolA protein using Gerbu 
GMDP adjuvant (CC Biotech). The 40 mM imidazole eluiion was mixed with Gerbu 
adjuvant and used to immunize mice. Each mouse received a subcutaneous injection of 100 
Ml antigen/adjuvant mix (12 M g antigen + 1 M g adjuvant) on day 0. Mice were subcutancously 
boosted as above on day 14 and bled on day 28. Control mice received pUisBotB protein 
(prepared as described in Ex. 35 below) in Gerbu adjuvant. 

Anti-C. hotuiinum serotype A toxoid titers were determined in serum from individual 
mice from each group using the ELISA described in Example 23a with the exception that the 
initial testing serum dilution was 1:100 in blocking buffer containing 0.5% Tween 20. 
followed by serial 5-fold dilutions into this buffer. The results of the EI. ISA demonstrated 
b that seroconversion (relative to control mice) occurred in all 8 mice. 

Ihe ability of the anti-C hotuiinum serotype A C fragment antibodies present in serum 
from the immunized mice to neutralize native C hotuiinum type A toxin was tested using the 
mouse neutralization assay described in Example 23b. The amount of neutralizing antibodies 
present in the serum of the immunized mice was determined using serum antibody titrations. 
20 The various serum dilutions (0.01 ml) were mixed with 5 LD<„ units of (\ hotuiinum type A 
toxin and the mixtures were injected IP into mice. The neutralizations were performed in 
duplicate. The mice were then observed for signs of botulism for 4 days. Undiluted serum 
was found to protect 100% of the injected mice while the 1:10 diluted serum did not. This 
corresponds to a neutralization titer of 0.05-0.5 lU/ml. 
25 I,lcse results demonstrate that neutralizing antibodies were induced when the 

6HisBotA protein was utilized as the immunogen. Furthermore, these results demonstrate that 
seroconversion and the generation of neutralizing antibodies does not depend on the specific 
N terminal extension present on the recombinant C. hotuiinum type A C fragment proteins. 
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EXAMPLE 30 

Construction Of Vectors For The Expression Of His-Tagged 
C. hotulinum Type A Toxin C Fragment Protein Using the Synthetic Gene 

A number of expression vectors were constructed which contained the synthetic C. 
hmdimnv type A toxin C fragment gene. These constructs vary as to the promoter (T7 or 
T7lac) and repressor elements (laclq) present on the plasmid. The T7 promoter is a stronger 
promoter than is the T7lac promoter. The various constructs provide varying expression 
levels and varying levels of plasmid stability. This example involved a) the construction of 
expression vectors containing the synthetic C. bo.ulinum type A C fragment gene and b) the 
determination of the expression level achieved using plasmids containing either the kanamycin 
resistance or the ampiciiiin resistance genes in small scale cultures. 

a) Construction Of Expression Vectors Containin R The 
Synthetic C. botutinum Type A C Fragment Gene 

lixpression vectors containing the synthetic ('. hondinum type A C fragment gene were 
engineered to utilize the kanamycin resistance rather than the ampiciiiin resistance gene. This 
was done for several reasons including concerns regarding the presence of residual ampiciiiin 
in recombinant protein derived from plasmids containing the ampiciiiin resistance gene. In 
addition, ampiciiiin resistant plasmids are more difficult to maintain in culture: the \V 
lactamase secreted by cells containing ampiciiiin resistant plasmids rapidly degrades 
extracellular ampiciiiin. allowing the growth of piasmid-negalive cells. 

A second altered feature of the expression vectors is the inclusion of laclq gene in the 
plasmid. This repressor lowers expression from lac regulated promoters (,he chromosomally 
located, lactose regulated T7 polymerase gene and the plasmid located T7lac promoter). This 
down regulates uninduced protein expression and can enhance the stability of recombinant cell 
lines. The final alteration to the vectors is the inclusion of either the T7 or T7lac promoters 
that drive high or moderate level expression of recombinant protein, respectively. 

The expression plasmids were constructed as follows. In all cases, the protein 
expressed is the P HisBotA(svn) protein previously described, and the only differences between 
constructs is the alteration of the various regulatory elements described above. 



- 177 - 



WO 98/08540 



PCT/US97/15394 



i) Construction Of pHisBotA(syn) kan T7lac 

The pHisBotA(syn) kan T7lac construct was made by inserting the Sapl/Xho] fragment 
containing the C\ hotulinum type A C fragment from pHisBotA(syn) into pET24 digested with 
Sapl/Xhol (Novagcn: fragment contains kan gene and origin of replication). The desired 
construct was selected for kanamycin resistance and confirmed by restriction digestion. 

ii) Construction Of pllisBotA(svn) kan laclq T7lac 

I he pHisBotA(syn) kan laclq T7lac construct was made by inserting the A7?al///mdIII 
fragment containing the ('. hotulinum type A C fragment from pl!isBotA(syn)kanT7lac into 
the P F.T24a vector digested with XbaVHindUL The resulting construct was confirmed by 
restriction digestion. 

iii) Construction Of pHisBotA(syn) kan laclq T7 

I he plIisBotA(syn) kan laclq T7 construct was made by inserting the A7wI////ndIII 
fragment containing the C. hotulinum type A C fragment from pIIisBotA(syn) kan laclq T71ac 
; nto A7wl///wdlll-diecsicd pHisBotB(syn) kan laclq T7 (described in En 37c below). The 
• suiting construct was confirmed by restriction digestion. 



i 

re 



h) Determination Of The Expression Level Achieved Using 

Plasmids Containing Either The Kanamycin Resistance Or 
The Ampicillin Resistance Genes In Small Scale Cultures 

One liter cultures of pHisBotA(syn) kan T7lac/BI2l(DE3)pLysS and pHisBotA(syn) 
amp T7Iae/B121(DE3)pLysS [this is the previously designated pi lisBotA(syn) construct) were 
grown, induced and his-tagged proteins were purified as described in Example 28. No 
differences in yield or protein integrity/purity were observed. 

These results demonstrate that the antigen induction levels from expression constructs 
were not affected by the choice of ampicillin versus kanamycin antibiotic resistance genes. 
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EXAMPLE 31 

Fermentation Of Cells Expressing Recombinant Botulinal Proteins 

a) Fermentation Culture Of Cells Expressing Recombinant 
Botulinal Proteins 

Fermentation cultures were grown under the following conditions which were 
optimized for growth of the BL21(DE3) strains containing pFT derived expression vectors. 
An overnight 1 liter feeder culture was prepared by inoculating of I liter media (in a 2L 
shaker llask) with a fresh colony grown on an LB kan plate. The feeder culture contained: 
600 mis nitrogen source [20 gm yeast extract (BBL) and 40 gm tryptone (BBL)/600 mis]. 200 
mis 5X fermentation salts (per liter: 48.5 gm K : HP0 4 . 12 gm NalKPCVHX*. 5 gm NH 4 C1. 
2.5 gm NaCI). 180 mis dH : 0. 20 mis 20% glucose. 2 mis 1 M MgS0 4 , 5 mis 0.05M CaCK 
and 4 mis of a 10 mg/ml kanamycin stock. All solutions were sterilized by autoclaving. 
except the kanamycin slock which was filter sterilized. 

An aliquot (5 ml) of the feeder culture broth was removed prior to inoculation, and 
grown for 2 days at 37°C as a culture broth sterility control. Growth was not observed in this 
control culture in any of the fermentations performed. 

The inoculated feeder culture was grown for 12-15 hrs (ON) at 3()-37°C. Care was 
taken to prevent ovcrsaturation of this culture. The saturated feeder culture was added to 10L 
of fermentation media in fermenter (BiofloIV. New Brunswick Scientific. Edison. N.I) as 
follows. The fermenter was sterilized 120 min at I21°C with dH 2 0. The sterile water was 
removed, and fermentation media added as follows: 6 liters nitrogen source, 2 liters 5X 
fermentation salts. 2 liters 2% glucose. 20 mis 1 M MgS0 4 . 50 mis 0.05 M CaCk 2.5-3.5 
mis Macol P 400 ant i foam (PPG Industries Inc.. Gurnee. IL), 40 mis lOmg/ml kanamycin and 
10 mis trace elements (8 gm FeSO/7H,0. 2 gm MnSO,-H : (). 2 gm AICI : *6H,0. 0.8 gm 
CoC>6H : 0. 0.4 gm ZnSO/7H ? 0. 0.4 gm Na : MoO/2H : 0. 0.2 gm CuCh*2HX>. 0.2 gm 
NiCk 0.1 gm H,BO./200mls 5 M HCI). All solutions were sterilized by autoclaving. except 
the kanamycin stock which was filter sterilized. Fermentation media was prewarmed to 37°C 
before the addition of the feeder culture. 

After the addition of the feeder culture, the culture was fermented at 37°C. 400 rpm 
agitation, and 10 I/min air sparging. The DO : control was set to 20% PID and dissolved 
oxygen levels were controlled by increasing the rate of agitation from 400-850 rpm under 
IX), control. DO, levels were maintained at greater than or equal to 20% throughout the 
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entire fermentation. When agitation levels reached 500-600 rpm the temperature was lowered 
to 30°C to reduce the oxygen consumption rate. Culture growth was continued until 
endogenous carbon sources were depleted. In these fermentations, glucose was depleted first 
[monitored with a glucose monitoring kit (Sigma)], followed by assimilation of acetate and 
other acidic carbons [monitored using an acetate test kit (Boehringcr Mannheim)]. During the 
assimilation phase, the pH rose from 6.6-6.8 (starting P H) lu 7.4-7.5. at which time the bulk 
of the remaining carbon source was depleted. This was signaled by a drop in agitation rate 
(from a maximum of 700-800 rpm) and a rise in DO, levels >3()%. This corresponds to a 
OD w „, reading of 18-20/ml. At this point a fed batch mode was initiated, in which a feed 
solution of 50% glucose was added at a rate of approximately 4 gm glucose/liter/hr. The pH 
was adjusted to 7.0 by the addition of 25% H } PO, (approximately 60 mis). Culture growth 
was continued and reached peak oxygen consumption within the next 3 hrs of growth (while 
the remaining residual non-glucose carbon sources were assimilated). This phase is 
characterized by a slow increase in pi I. and air sparging was increased to l5L/min. to keep 
the maximum rpm below 850. Once the residual acidic carbon sources are depleted the 
agitation rate decreases to 650-750 rpm and the pH begins to drop. pH control was 
maintained ai 7.0 PID by regulated pump addition of a sterile 4M NaOH solution which was 
consumed at a steady rate for the remainder of the fermentation. Growth was continued at 
30°C. and the cultures were grown linearly at a growth rate of 4-7 OD h „„ units/hr. to at least 
81.5 OD,,,,,, units/ml (>30g/l dry cell weight) without induction. Antifoam (a 1:1 dilution with 
filter sterilized 100% ethanol) was added as necessary throughout the fermentation to prevent 
foaming. 

During the fed batch mode, glucose was assimilated immediately (concentration in 
media consistently less than 0.1 gm/liter) and acetate was not produced in significant levels by 
the pHT plasmid/BL21(DE3) cell lines tested (approximately 1 gm/liter at end of 
fermentation: this is lower than that observed in harvests from shaker flask cultures utilizing 
the same strains). This was fortuitous, since high levels of acetate has been shown to inhibit 
induction levels in a variety of expression systems. The above described conditions were 
found to be highly reproducible between fermentations and utilizing different expression 
plasmids. As a result, glucose and acetate level monitoring were no longer preformed during 
fermentation. 
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b) Induction Of Fermentation Cultures 

Induction with IPTG (250 mg-10 gms, depending on the expression vector and 
experiment) was initiated 1-3 hrs after initiation of the glucose feed (30-50 OD^/ml). The 
growth rate after induction was monitored on a hourly basis. Aliquots (5-10 ml) of ceils were 
harvested at the time of induction, and at hourly intervals post-induction. Optical density 
readings were determined by measuring the absorbance at 600 nm of 10 jil culture in 990 jil 
PBS versus a PBS control. The growth rate after induction was found to vary dependinu on 
the expression system utilized. 

c) Monitoring Of Fermentation Cultures 

Fermentation cultures were monitored using the following control assays. 

i) Colony Forming Ability 

An aliquots of cells were removed from the cultures at each timepoint sampled 
(uninduced and at various times after induction) were serially diluted in PBS (dilution 1 = 15 
Ml cells/3 ml PBS. dilution 2 - I 5 of dilution 1/3 ml PBS, dilution 3 - 3 or 6 |il of 
dilution 2/3mls PBS) and 100 |al of dilution 3 was plated on an LB or TSA (trypticase soy 
agar) plate. The plates were incubated ON at 37°C and then the colonies are counted and * 
scored for macro or micro growth. 

ii) Phenotypic Characterization 

Colonies growing on LB or TSA plates (above) from uninduced and induced 
timepoints were replica plated onto LB+kan. LB+chloramphenicol (for fermentations utilizing 
LysS or pACYCGro plasmids). LB+kan+lmM IPTG and LB plates, in this order. The plates 
were grown 6-8 hrs at 37°C and growth was scored on each plate for a minimum of 40-50 
well isolated colonies. The percentage of cells retaining the plasmid at time of induction (i.e.. 
uninduced cultures immediately prior to the addition of IPTG) was determined to be the # 
colonies LB+Kan (or chloramphenicol) plate/* colonics LB plate X 100%. The percentage of 
cells with mutated pET plasmids was determined to be the tt colonies LB+Kan+IPTG piate/# 
colonies LB plate X 100%. Colonies on all LB plates were scored morphologically for E. 
coli phenotype as a contamination control. Morphologically detectable contaminant colonies 
were not detected in any fermentation. 
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iii) Recombinant BotA Protein Induction 

A total of 10 OD M units of cells (e. K .. 200 ul of cells at OD M ,„-50/ml) were removed 
from each timepoint sample to a 1.5 ml microfuge lube and pelleted for 2 min at maximum 
rpm in a microfuge. The pellets were resuspended in I ml of 50 mM NallPO,. 0.5 M NaCI. 
40mM imidazole buffer ( P H 6.8) containing I mg/ml lysozyme. The samples were incubated 
for 20 min a. room temperature and stored ON at -7()°C. Samples were thawed completely at 
room temperature and sonicated 2X10 seconds with a Branson Sonifier 450* micro.ip probe 
at # 3 power setting. The samples were centrifuged for 5 min at maximum rpm in a 
microfuge. 

An aliquot (20 u.1) of the protein samples were removed lo 20 M | 2X sample buffer, 
before or after centrifugation. for total and soluble protein extracts, respectively. The samples 
were heated to 95°C for 5 min. then cooled and 5 or 10 ul were loaded onto 12.5% 
SDS-PAGE gels. High molecular weight protein markers (BioRad) were also loaded to allow 
tor estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected either generally by staining gels with Coomassie blue, or specifically, by blotting 
onto nitrocellulose (as described in lix. 28) for Western blot detection of specific his-tagged 
proteins utilizing a NiNTA-alkalinc phosphatase conjugate exactly as described by the * 
manufacturer (Oiagcn). 



iv) Recombinant Antigen Purification 

At the end of each fermentation run. 1-10 liters of culture were harvested from the 
lermentcr and the bacterial cells were pelleted by centrifugation at 6000 rpm for 10 min in a 
M10 rotor (Beckman). The cell pellets were stored frozen at -70°C or utilized immediately 
without freezing. Cell pellets were resuspended to 15-20% weight to volume in ^suspension 
buffer (generally 50 mM NaPO,. 0.5 M NaCI. 40mM imidazole. pH 6.8) and lysed utilizing 
either sonication or high pressure homogenization. 

l or sonication. the resuspension buffer was supplemented with lysozyme to 1 mg/ml. 
and the suspension was incubated for 20 min. at room temp. The sample was then frozen ON 
ai -70°C. thawed and sonicated 4 X 20 seconds at microtip maximum to reduce viscosity. 
l ; or homogenization. the cells were lyzed by 2 passes through a homogenizer (Rannie 
Mini-lab type 8.30 H) at 600 Bar. Cell lysates were clarified by centrifugation lor 30 min at 
10.000 rpm in a JA 10 rotor. 
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For IDA chromatography, samples were flocculated utilizing polyethyleneirnine (PEI) 
prior to centrifugation. Cell pellets were resuspended in cell resuspension buffer (CRB: 50 
mM NaP0 4 . 0.5 M NaCI. 40 mM imidazole, pH 6.8) to create a 20% cell suspension (wet 
weight of cells/volume of CRB) and cell lysates were prepared as described above (sonication 
or homogenization). PEI (a 2% solution in dHA pH 7.5 with HCI) was added to the cell 
lysate a final concentration of 0.2%. and stirred for 20 min at room temperature prior to 
centrifugation (8.500 rpm in JAM rotor for 30 minutes at 4°C). This treatment removed 
RNA. DNA and cell wall components, resulting in a clarified, low viscosity lysate ("PHI 
clarified lysaie"). 

His-tagged proteins were purified from soluble lysates by metal-chelatc affinity 
chromatography using either a NiNTA resin (as described in Ex. 28) or an IDA (iminodiacetic 
acid) resin as described below. 

IDA resin affinity purifications were performed utilizing a low pressure 
chromatography system (1SCO). A 7 ml (small scale) or 70 ml (large scale) Chelating 
Sepharose Fast Flow (Pharmacia) affinity column was poured; in addition, a second guard 
column was poured and attached in line with the first column (to capture Ni ions that leached 
off the affinity column). The columns were washed with 3 column volumes of dI KO. The 
guard column was then removed and the affinity column was washed with 0.3 M NiSO, until 
resistivity was established, then with dH : 0 until the resistivity returned to baseline. The 
columns were reconnected and equilibrated with cell resuspension buffer (CRB; 50 mM 
NaP() 4 . 0.5 M NaCI. 40 mM imidazole. pH 6.8). The clarified sample (in CRB) was loaded. 
How rates were 5 ml/min for small scale columns and 20 ml/min for large scale columns. 
Alter sample loading, the column was washed with CRB until a baseline established and 
bound protein was eluted with clution buffer (50 mM NaPO„ 0.5 M NaCI, 800 mM 
imidazole. 20% glycerol. pH 6.8 or 8.0). Protein samples were stored at 4°C or -20°C. The 
yield of eluted protein was established by measuring the OD :m of the elutions. with a 1 mg/ml 
solution of protein assumed to yield an absorbance reading of 2.0. 

The IDA columns may be regenerated and reused multiple times (>10). To regenerate 
the column, the column was washed with 2-3 column volumes of H : 0. then 0.05 M FDTA 
until all of the blue/green color was removed followed by a wash with dHX). The IDA 
columns were sterilized with 0.1 M NaOH (using at least 3 column volumes but not more 
than 50 minutes contact time with column packing material), then washed with 3 column 
volumes 0.05 M NaPO,. pH 5.0. then dIKO and stored at room temperature in 20 % ethanol. 



- 183 - 



WO 98/08540 



PCT/US97/15394 



10 



20 



25 



30 



EXAMPLE 32 

Construction Of A Folding Chaperonc Overexpression System 

Co-ovcrexpression of the E. coli GroEL/GroES folding chaperones in a cell expressing 
a recombinant foreign protein has been reported to enhance the solubility of some foreign 
proteins that are otherwise insoluble when expressed in E. coli [Gragerouu et ul. (1992) Proc. 
Natl. Acad. Sci. USA 89:10344]. The improvement in solubility is (bought to be due to 
chaperone-mediated binding and unfolding of insoluble denatured proteins, thus allowing 
multiple attempts for productive refolding of recombinant proteins. By overcxpressing the 
chaperones. the unfolding/refolding reaction is driven by excess chapcrone. resulting, in sonic 
cases, in higher yields of soluble protein. 

In this example, a chaperone overexpression system, compatible with pET vector 
expression systems, was constructed to facilitate testing chaperone-mediated solubilization of 
C. hoiulimim lype A proteins. This example involved the cloning of ihe GioEL/ES operon 
and construction of a pLysS-based chaperone hypercxpression system. 

I he GroEL/GroES operon was PGR amplified and cloned into the pC'KScript vector as 
described in Example 28. The following primer pair was used: 5-CGCAT 
ATGAAT ATTCGTCCATTGCATG-3' (SEQ ID NO:37) |5" primer, start codon of groES 
gene converted to NJc\ site (underlined)] and 5 ' -GG A AGCTT GC AGGGr A A r 7>fCATCATG 
(SEQ ID NO:38) (3' primer, stop codon of groEL gene italicized, engineered //mdlll site 
underlined), following amplification, the chaperone operon was excised as an AV/el/V/mdlll 
fragment and cloned into pET23b digested with yVc/d and //mdlll. This construction places 
the Gro operon under the control of the T7 promoter of the P ET23 vector. The desired 
construct was confirmed by restriction digestion. 

The T7 promotcr-Gro operon-T7 terminator expression cassette was then excised as a 
BttlWIJspU (filled) fragment and cloned into BctmHl (compatible with /^/ll)////,jdlll (filled) 
cleaved pLysS plasmid (this removed the T7 lysozyme gene). The resulting construct was 
designated pACYCGro. since the plasmid utilizing the pACYC184 origin from the plysS 
plasmid. Proper construction was confirmed by restriction digestion. 

pACYCGro was transformed into BL2KDE3). cultures were grown and induced with 
1 mM IPTG as described in preceding examples. Total and soluble protein extracts were 
generated from cells removed before and after IPTG induction and were resolved on a 12.5 % 
SDS-PAGE gel and stained with Coomassie blue. This analysis revealed that high levels of 
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soluble GroEl and GroES proteins were made in the induced cells. These results 
demonstrated that the chaperone hypcr-cxpression system was functional. 

EXAMPLE 33 

Growth Of BotA/pACYCGro Cell Lines In Fermentation Cultures 

Induction of BL2HDE3) cells lacking the LysS plasmid which contained BotA 
expression constructs grown in shaker tlask or fermentation culture resulted in the expression 
of primarily insoluble BotA protein. Fermentation cultures were performed to determine if 
the simultaneous overexpression of the Gro operon and recombinant C. hotui'mum type A 
proteins (BotA proteins) resulted in enhanced solubility of the recombinant BotA protein. 
This example involved the fermentation of pHisBotA(syn)kan laclq I71ac/pACYCGro 
BL2KDE3) and pHisBotA(syn)kan laclq T7/pACYCGro BL21(OE3) cell lines. The 
fermentations were repeated exactly as described in Example 31. Chloramphenicol (34 
^tg/ml) was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBotA(syn)kan laclq IZiac/pACYCGro 
BL2I(I)E3) Cells 

For fermentation of cells containing plasmids comprising the T7lac promoter, 
induction was with 2 gms IPTG at 1 hr post initiation of glucose feed. The OD„ )0 was 35 at 
lime of induction, then 48.5. 61.5. 67 at 1-3 hrs post induction. Viable colony counts 
decreased from 0-3 hr induction [21 (13), 0, 0. 0: dilution 3 utilized 3 M l of dilution 2 cells] 
with numbers in parenthesis for the indicating microcolonies. Of 28 colonies scored at the 
lime of induction. 23 retained the pi iisBotA(syn)kan laclq T7lac plasmid (kan resistant). 22 
contained the chaperone plasmid (chloramphenicol resistant) and no colonies at induction 
grew on MTG+Kan plates (no mutations delected). These results were indicative of very 
strong promoter induction, since colony viability dropped immediately after induction. 

Total and soluble extracts were resolved on a 12.5% SDS-PAGE gel and stained with 
Coomassie. High level induction of Gro chaperones was observed, but very low level 
expression of soluble BotA protein was observed, increasing from I to 4.0 hrs post induction 
(no expression detected in uninduced cells). The dramatically lower expression of the BotA 
antigen in the presence of chaperone may be due to promoter occlusion {i.e.. the stronger T7 
promoter on the chaperone plasmid is preferentially utilized). 
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b) Fermentation Of pHisBotA(syn)kan laclq Til pACYCGro 
BL21(DE3) Cells 

A fermentation utilizing the T7-driven BotA expression plasmid was performed. 
Induction was with 1 g m ip TG at 2 hrs post initiation of glucose feed. The OD M „,was 41 at 
time of induction, then 51.5. 61.5. 61.5 and 66 at 1-4 hrs post induction. Viable colonv 
counts decreased from 0-4 hrs induction [71. 1 (34). 1 ( I ). 1.0: dilution 3 utilized 6 u| 
dilution 2 cells) with numbers in parenthesis for the uninduccd timepoint indicating 
microsomes. Of 65 colonies scored at the time of induction, all 65 retained both the 
P HisBotA(syn)kan laclq T7 plasmid (kan resistant) and the chaperonc plasmid 
(chloramphenicol resistant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations delected). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGF gel and stained with 
Coomassie. High level induction of Oro chaperones and moderate level expression of soluble 
BotA protein was observed, increasing from I to 4.0 hrs post induction (no expression 
detected in uninduced cells). 

A PLl-clarified lysate (0.2% final cocnentration PF.I) [850 ml from 130 gm cell pelle. 
(2 liters fermentation harvest) | was purified on a large scale IDA column. A total of 78 mg 
of protein was cluted. Fxtracts from the purification were resolved on a 12.5% SDS-PAGF 
gel and stained with C oomassie. The elution was found to contain an approximately 1:1 mix 
of BoiA/chaperone protein (Figure 32). PEI lysales prepared in this manner were typically 16 
OD :S( /ml. This was estimated to be 8 mg protein/ml of lysate (by BCA assay). Thus, the 
cluted recombinant BotA protein represented 0.55% of the total soluble cellular protein 
applied to the column. 

In Figure 32. lane I contains molecular weight markers, lanes 2-9 contain extracts 
from P llisBotA(syn)kan laclq T7/pACYCGro/BL2 1 ( DF3 ) cells before or during purification 
on the IDA column. Lane 2 contains total protein extract: lane 3 contains soluble protein 
extract: lanes 4 and 5 contain PFI-clarifled lysates (duplicates): lanes 6 and 7 contain flow- 
through from the IDA column (duplicates) and lanes 8 and 9 eonlain IDA co|umn dulc ( , ane 
l > contains 1/10 the amount applied to lane 8). 

These results demonstrate, that although the majority of the BotA protein produced 
was insoluble. 20 mg/liter of soluble recombinant BotA protein can be purified utilizing the 
pHisBotA(syn)kan laclq IZ'pACYa»ro/BL2l(l)E3) expression system. 
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EXAMPLE 34 

Purification Of Recombinant BotA Protein From Folding Chaperones 

In this example of size exclusion chromatography was used to purify the recombinant 
BotA protein away from the folding chaperones and imidazole present in the IDA-purified 
material (Ex. 33). 

To enhance the solubility of the recombinant BotA protein during scale-up. the protein 
was co-expressed with folding chaperones (Ex. 33). As observed with the recombinant BotB 
protein (Example 40 below), the folding chaperones co-eluted with the recombinant BotA 
protein during the Ni-IDA purification step. Because the recombinant BotA and BotB 
proteins have similar molecular weights (about 1/10 the size of the non-reduced folding 
chaperone) and the imidazole step gradient strategy was unsuccessful in purifying BotB away 
from ihc folding chaperone (see Ex. 40). size exclusion chromatography was examined for the 
ability to purify the recombinant BotA protein away from the folding chaperones. 

A column (2.5 x 24 cm) containing Sephacryl S-100 HR (Pharmacia) was poured (bed 
volume I 10 ml). Proteins having molecular weights greater than 100 K are expected to elute 
in the void volume under these conditions and smaller proteins should be retained by the 
beads and elute at different times, depending on their molecular weights. To maintain 
solubility of the purified BotA protein, the Sephacryl column was equilibrated in a buffer 
having the same salt concentration as the buffer used to elute the BotA protein from the IDA 
column (i.e.. 50 mM sodium phosphate. 0.5 M NaCI. 10% glycerol; all reagents from 
Mallinkrodl. Chesterfield. MO). 

l ive milliliters of the IDA-purified recombinant BotA protein (Ex. 33) was filtered 
through a 0.45 M syringe filter, applied to the column and the equilibration buffer was 
pumped through the column at a How rate of 1 ml/minute. Eluted proteins were monitored 
by absorbance at 280 nm and collected either manually or with a fraction collector (BioRad). 
Appropriate fractions were pooled, if necessary, and the protein was quantitated by absorbance 
at 280 nm and/or BCA protein assay (Pierce). The isolated peaks were then analyzed by 
native and/or SDS-PAGE to identify the proteins present and to evaluate purity. The folding 
chaperone eluted first, followed by the recombinant BotA protein and then the imidazole 
peak. 

SDS-PAGE analysis (12.5% polyacrylamide. reduced samples) was used to evaluate 
ihc purity of the IDA-purilied recombinant BotA protein before and after S-100 purification. 



- 187 



WO 98/08540 



PCT/US97/15394 



Figure 33 shows the difference in purity before and after the S-100 purification step. In 
Figure 33. lane I contains molecular weight markers (BioRad broad range). Lane 2 shows 
the IDA-purified recombinant BotA protein preparation, which is contaminated with 
significant amounts of the folding chaperone. Following S-100 purification, the amount of 
folding chaperone present in the BotA sample is reduced dramatically (lane 3). Lane 4 
contains no protein {i.e.. it is a blank lane): lanes 5-8 contain samples of IDA-purified 
recombinant BotB and BotE proteins and are discussed infra. 

Endotoxin levels in the S-100 purified BotA preparation were determined using the 
LAE assay (Associates of Cape Cod) as describe in Example 24. The purified BotA 
preparation was found to contain 22.7 to 45.5 EU/mg recombinant protein. 

These results demonstrate that size exclusion chromatography was successful in 
purifying the recombinant BotA protein from folding chaperones and imidazole following an 
initial IDA purification step. Furthermore, these results demonstrate thai the S-100 purified 
BotA protein was substantially free of endotoxin. 

EXAMPLE 35 

Cloning And Expression Of The C Fragment 
Of The (.'. lumdinum Serotype B Toxin Ciene 

I he ( . hoiulinum type B neurotoxin gene has been cloned and sequenced | Whelan ci 
ul. (1992) Appl. Environ. Microbiol. 58:2345 and Hutson ei ul. (1994) Curr. Microbiol. 
28:101]. The nucleotide sequence of the toxin gene derived from the Eklund I7B strain 
(ATCC 25765) is available from the EMBL/GenBank sequence data banks under the 
accession number X7I343: the nucleotide sequence of the coding region is listed in SEQ ID 
NO:39. The amino acid sequence of the C. hoiulinum type B neurotoxin derived from the 
strain Eklund 17B is listed in SEQ ID NO:40. The nucleotide sequence of the ( '. hoiulinum 
serotype B toxin gene derived from the Danish strain is listed in SEQ ID NO:4l and the 
corresponding amino acid sequence is listed in SEQ ID NO:42. 

The DNA sequence encoding the native C. hoiulinum serotype B C fragment gene 
derived from the Eklund I7B strain can be expressed using the pETHisb vector: the resulting 
coding region is listed in SEQ ID NO:43 and the corresponding amino acid sequence is listed 
m SEQ ID NO:44. The DNA sequence encoding the native C. hoiulinum serotype B C 
fragment gene derived from the Danish strain can be expressed using the pETHisb vector; the 
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resulting coding region is listed in SEQ ID NO:45 and the corresponding amino acid sequence 
is listed in SEQ ID NO:46. The C frgament region from any strain of C. botulinum serotype 
B can be amplified and expressed using the approach illustrated below using the C fragment 
derived from C. botulinum type B 2017 strain. 

The ( botulinum type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds: the type B neurotoxin has been reported to exist as a mixture of predominate 
single chain with some double chain (Whclan el at., supra). The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the H v domain. Expression of 
the C fragment of C. botulinum type B toxin in heterologous hosts (<.< #.. £ coli) has not been 
previously reported. 

The native C fragment of the C. botulinum serotype B toxin gene was cloned and 
expression constructs were made to facilitate protein expression in f. coli. This example 
involved PCR amplification of the gene, cloning, and construction of expression vectors. 

The C fragment of the C. botulinum serotype B (BotB) toxin gene was cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene. The C. botulinum type B 2017 strain was obtained from the American Type Culture 
Collection (ATCC * 1 7843). The following primer pair was used in amplify the BotB gene: 
5 -C( "CCATJ^iGC'I GA I'ACAATACTAATAGAA ATC.-.V (5" primer, engineered Ncol site 
underlined (SEQ ID NO:47)] and S -CJCAAGXH^TTCAGTCCACCCTTCATC-.V |.V 
primer, engineered /-/mdlll site underlined, native gene termination codon italicized (SEQ ID 
NO:4H)|. Alter cloning into the pCRscript vector, the A7f cl( fl 1 led )////mlI I [ fragment was 
cloned into pETUisb vector as described for BotA C fragment gene in Example 28. The 
resulting construct was termed pllisBotB. 

pHisBotB expresses the BotB gene sequences under the transcriptional control of the 
1*7 lac promoter and the resulting protein contains an N-terminal !0XHis-tag affinity tag. The 
pllisBotB expression construct was transformed into BL2I(DE3) pLysS competent cells and 1 
liter cultures were grown, induced and his-tagged proteins were purified utilizing a NiNTA 
resin (cluted in low pi I cluiion buffer) as described in Example 28. Total, soluble and 
purified proteins were resolved by SDS-I'AGE and detected by Coomassie staining and 
Western blot hybridization utilizing a chicken anli-C. botulinum serotype B toxoid primary 
antibody (generated by immunization of hens using C. botulinum serotype B toxoid as 
described in Example 3). Samples of BotA and BotE C fragment proteins were included on 
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ihe gels ibr MW and immunogenicity comparisons. Strong immunoreactivity to only the 
BotB protein was detected with the ami-C. hotulinum serotype B toxoid antibodies. The 
recombinant BotB protein was expressed at low levels (3 mu/liter) as a soluble protein. The 
purified BotB protein migrated us a single band of the predicted MW (i.e.. ~50kD). 

These results demonstrate the cloning of the native ( hotulinum serotype B C 
fragment gene, the expression and purification of the recombinant BotB protein as a soluble 
his-tugged protein in E. co/i. 



EXAMPLE 36 

Generation Of Neutralizing Antibodies Using The Recombinant pHisBotB Protein 

The ability of the purified pl lisBol protein to generate neutralizing antibodies was 
examined. Nine BALBc mice were immunized with BotB protein (purified as described in 
Kn. 35) using Cierbu CiMDP adjuvant (CC Biotech). The low pi I elulion was mixed with 
(ierbu adjuvant and used to immunize mice. Each mouse received a subcutaneous injection 
of 100 M l antigen/adjuvant mix (15 ug antigen + I ug adjuvant) on day 0. Mice were 
subcutaneously boosted as above on day 14 and bled on day 28. Mice were subsequently 
boosted I-: weeks after bleeding and were then bled on day 70. 

Anti-f. hotulinum serotype B toxoid titers were determined in day 28 serum from 
individual mice from each group using the F.LISA protocol outlined in lixample 29 with the 
exception that the plates were coated with C. hotulinum serotype B toxoid, and the primary 
aniibod> was a chicken anti-C. hotulinum serotype B toxoid. Seroconversion | relative to 
control mice immunized with pHisBotK antigen (described below) | was observed with all 0 
mice immunized with the purified pHisBotB protein. 

The ability of the anti-BotB antibodies to neutralize native C. hotulinum type » toxin 
was tested in a mouse-C. hotulinum neutralization model using pooled mouse serum (see Ex. 
23b). The LD<„ of purified C. hotulinum type B toxin complex (Dr. Liric Johnson. University 
of Wisconsin. Madison) was determined by a intraperitoneal (ll>) method fSchantz and Kautler 
(1978). supro\ using 18-22 g female ICR mice. The amount of neutralizing antibodies present 
in the serum of the immunized mice was determined using serum antibody titrations. The 
various serum dilutions (0.01 ml) were mixed with 5 LD<„ units of C. hotulinum type B toxin 
and the mixtures were injected IP into mice. The neutralizations were performed in duplicate. 
I he mice were then observed for signs of botulism for 4 days. Undiluted serum (dav 28 or 
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day 70) was found to protect 100% of the injected mice while the 1:10 diluted serum did not. 
I his corresponds to a neutralization titer of 0.05-0.5 lU/ml. 

These results demonstrate thai seroconversion occurred and neutralizing antibodi 
were induced when the pilisBotB protein was utilized as the immunouen. 
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EXAMPLE 37 

Construction Of Vectors To Facilitate Impression 
Of His-Taggcd BotB Protein In Fermentation Cultures 

A number of expression vectors were constructed to facilitate the expression of 
recombinant Bo.B protein in large scale fermentation culture. These constructs varied 
«hc strength of the promoter utilized (T7 or T7iac, and ,he presence of repressor elements 
daclq, on the p.asmid. The resulting constructs varied in the .eve, of expression achieved and 
■n plasm.d stability which facilitated the selection of a optimal expression system for 
leimeiuation scaleup. 

The BotB expression vectors created for fermentation culture were ent-ineered to 
utilize the kanamycin rather than the ampicillin resistance gene, and contained either the T7 
Hlac promoter, with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBo, B 
P-tem described in Example 35. with the only differences between clones being the alteration 
«« varu>us regulatory elements. Using the designations outlined below. ,he p „i sB otB clone 
(i-.x. 35) is equivalent to pilisBotB amp T7lac. 

a) Constructicin Of pilisBotB kan T7lac 

pHisBotB kan T7lae was constructed by insertion of the ^/II////„dIII fraument of 
Pi lisBotB which contains the BotB gene sequences into the pPA1870-2680 kan T7lac vector 
wluch had been digested with ByM and ///„d.II (the pPA 1870-2680 kan T7lac vector contains 
ihc P hT24 kan gene in the P ET23 vector, such that no lack, gene is present) Proper 
construction of pHisBotB kan T7lac was confirmed by restriction diuestion 
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b) Construction Of pHisBotB kan laclq T7lac 

pHisBotB kan lack] T7Iac was constructed by insertion of the BgMfHindlU fragment 
of pHisBotB which contains the BotB gene sequences into similarly cut pET24a vector. 
Proper construction of pHisBotB kan laclq T7lac was confirmed by restriction diucstion. 

c) Construction Of pH is BotB kan laclq T7 

pHisBotB kan laclq 17 was constructed by inserting the Ndel/Xhol fragment from 
pi lisBotE kan laclq T7lac which contains the BotB gene sequences into similarly cleaved 
pPA 1 870-2680 kan laclq T7 vector (this vector contains the T7 promoter, the same N- 
terminai his-tag as the Bot constructs, the ('. difficile toxin A insert, and the kan laclq genes: 
this cloning replaces the C. difficile toxin A insert with the BotB insert). Proper construction 
was confirmed by restriction digestion. 

Expression of recombinant BotB protein from these expression vectors and purification 
of the BotB protein is described in Example 38 below. 

EXAMPLE 38 

fermentation And Purification Of Recombinant BotB Protein Utilizing The 
pHisBotB kan laclq T71ac. pHisBotB kan T7lac And pHisBotB kan laclq T7 Vectors 

The pHisBotB kan laclq T7lac. pHisBotB kan T7lac and BoiB kan laclq T7 constructs 
jail transformed into the B12WDE3) strain] were grown in fermentation cultures to determine 
the utility of the various constructs for large scale expression and purification of soluble BotB 
protein. All fermentations were performed as described in Example 31. 



a) Fermentation Of pHisBotB kan laclq T71ac/BI21(DE3) Cells 

The fermentation culture was induced 45 min post start of glucose feed with I gm 
IPTG (final concentration = 0.4 mM). pH was maintained at 6.5 rather than 7.0. The OD wjn 
was 27 at lime of induction, then 35. 38. and 40 at 1-3 hrs post induction. Duplicate platings 
of diluted I hr induction samples (dilutions were prepared as described Ex. 31. dilution 3 
utilized 3 \v\ of dilution 2 cells) on TSA and LB Ham plates yielded 80 TSA colonics and 81 
kan colonies (90% kan resistant). 

Total and soluble protein extracts were resolved on a 12.5% SDS-PAGE gel and total 
protein was detected by staining with Coomassic blue. Low level induction of insoluble Bot 



- 192 - 



WO 98/08540 



PCI7US97/15394 



B protein was observed, increasing from i to 3 hrs post induction (no expression was detected 
in uninduced cells). 



b) Fermentation Of pHisBotB kan T7lac/BI21(DE3) Cells 

The fermentation culture was induced I hr post start of glucose feed with 2 gm IPTG 
(l-nal concentration = 0.8 mM). pi I was maintained at 6.5 rather than 7.0. The OD (l(K1 was 
24.5 at time of induction, .hen 31.5. 32. and 33 at 1-3 hrs post induction, respectively.' 
Duplicate platings of diluted 0 hr and 2 hr induction samples (dilutions were prepared as 
described Ex. 31: dilution 3 utilized 3 ul of dilution 2 cells) on I SA and LB + kan plates 
yielded 32 TSA colonies and 54 kan colonies (all kan resistant) for uninduced cells, and 1 
I SA colony and 0 kan colonies 2 hr post induction. These results were indicative of strong 
induction, since viable counts decreased dramatically 2 hrs post induction. 

Total and soluble extracts were resolved on a 10% SOS-PAGE gel and total protein 
was detected by staining u ith Coomassie blue. Moderate induction of insoluble BotB pro.e 
was observed, increasing from I to 3 hrs post induction (no expression was detected in 
uninduced cells). 



in 



c) Fermentation Of pHisBotB kan laclq T7/BI21(DE3) Cells 

The fermentation was induced 2 hr post start of glucose feed with 4 ym IPTG (final 
concentration = 1.6 mM). pll was maintained at 6.5 rather than 7.0. The OD N „, was 45 a. 
lime of induction, then 47. 50. and 50 and 55 a. 1-4 hrs post induction, respectively. Viable 
colony counts decreased after induction (96. 1.1.2. 3: dilution 3 utilized 3 ul of dilution 2 
cells). Of 63 colonies scored a. the time of induction, all 63 retaining the BotB plasmid (kan 
resistant) and no colonies at induction grew on IPTG + Kan plates (no mutations detected). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGIi gel and total protein 
was detected by staining with Coomassie blue. Moderate level induction of insoluble BotU 
protein was observed, increasing from I to 4 hrs post induction (lower level expression was 
detected in uninduced cells, since the T7 rather than T7lac promoter was utilized). 

d) Purification Of pHisBotB Protein From pHisBotB amp 
T7lac/BI21(DE3) Cells 

Soluble recombinant BotB protein was purified utilizing NiNTA resin from 80 ml of 
cell lysate generated from cells harvested from a pHisBotB fermentation fusing the pHisBotB 
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amp T71ac/B121(DE3) sirainl. As predicted from the small scale results above, the majority 
of the induced protein was insoluble. As well, the cluted material was contaminated with 
multiple £ coli contaminant proteins. A Coomassie blue-stained SDS-PAGF gel containing 
extracts derived from pHisBotB amp T7lac/BI21(DF3) cells before and during purification is 
shown in Figure 34. In Figure 34. lane 1 contains broad range protein MW markers 
(BioRad). Lanes 2-5 contain extracts prepared from pHisBotB amp T71ac/BI2I(DI:'3) ceils 
grown in fermentation culture: lane 2 contains total protein: lane 3 contains soluble protein: 
lane 4 contains protein which did not bind to the NiNTA column (i.e.. the (low-through) and 
lane 5 contains protein cluted from the NiNTA column. 

Similar results were obtained using a small scale IDA column utilizing a cell lysate 
from the pHisBotB kan laclq T7 fermentation described above. 250 mis of a 20% w/v PFI 
clarified lysate (50 gms cell pellet) of botB kan laclq T7/BI2HDF.3) cells were purified on a 
small scale IDA column. The total yield of cluted protein was 21 mg protein (assuming I 
mg/ml solution , 2 OD :s ,/ml). When analyzed by SDS-PAGF: and Coomassie staining, the 
BotB protein was found to comprise approximately 50% of the cluted protein with the 
remainder being a ladder of £ coli proteins similar to that observed with the NiNTA 
purification. 

I'he NiNTA alkaline phosphatase conjugate was utilized to delect his-laggcd proteins 
on a Western blot containing total, soluble, soluble (PF1 clarified), soluble (after IDA column) 
and elmion samples from the IDA column purification. The results demonstrated that a small 
percentage of BotB protein was soluble, that the soluble protein was not precipitated by PF1 
treatment and was quantitatively bound by the IDA column. Since a 1 liter fermentation 
harvest yielded a 67.5 gm cell pellet, this indicated that the yield of soluble affinity purified 
BotB protein from the IDA column was 14 mg/litcr. 

EXAMPLK 39 

Co-Expression Of Recombinant BotB Proteins 
And Folding C'hapcrones In Fermentation Cultures 

Fermentations were performed to determine if the simultaneous overexpression of 
folding chaperones (i.e.. the Gro operon) and the BotB protein resulted in enhanced solubility 
of the Bot B protein. This example involved fermentation of the pIlisBotBkan laclq 
T7lac/pACYCC3ro BL21(DE3). pHisBotB kan T71ac/pACYCGro B12HDE3) and pIlisBotBkan 
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laclq Til pACYCGro BL21(DE3) cell lines. Fermentation was carried out as described in 
Example 31: 34 ug/ml chloramphenicol was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBotBkan laclq T7lac/pACYCGro 
BL21(DE3) Cells 

Induction was with 4 gms IPTG at 1 hr 15 min post initiation of the glucose feed. 
The OD,,, (l was 38 at time of induction, then 50. 58.5. 62 and 68 at 1-4 hrs post induction. 
Viable colony counts decreased during induction (24. 0. 0. 2. 0 at 0-4 hr induction: dilution 3 
utilized 3 M l of dilution 2 cells). Of 24 colonies scored at the time of induction. 24 retained 
the BotB plasmid (kan resistant). 24 contained the chaperone plasmid (chloramphenicol 
resistant) and no colonies at induction grew on JPTG+Kan plates (no mutations detected). 

Total and soluble extracts were resolved on 12.5% SDS-PAGF gels and were either 
stained with Coomassie blue or subjected to Western blotting (his-tagged proteins were 
detected utilizing the NiNTA-alkaline phosphatase conjugate). This analysis revealed that the 
(iro chaperone* were induced to high levels, but very low level expression of soluble BotB 
protein was observed, increasing from 1 to 4.0 hrs post induction (no expression detected in 
uninduccd cells, induced protein detected only on Western blot). The dramatically lower 
expression of BotB protein in the presence of chaperone may be due to promoter occlusion 
</.*.. the stronger T7 promoter on the chaperone plasmid was preferentially utilized). 

■>) Fermentation Of pHisBotB kan T7lac/pACYCGro/BI2l(DE3) 
Cells 

Induction was with 4 gms IPTG at I hr post initiation of the glucose feed. The OD MW 
was 33.5 at time of induction, then 44. 51. 58.5 and 69 at 1-4 hrs post induction. Viable 
colony counts decreased after 2 hrs induction (43. 65. 74. 0 (70). 0 (70) at 0-4 hr induction: 
bracketed numbers represent microcolonies; dilution 3 utilized 3 ul of dilution 2 cells). Most 
colonies a. induction retained the BotB plasmid (kan rcsistan«)and the chaperone plasmid 
(chloramphenicol resistant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations detected). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGK gel and subjected to 
Western blotting: his-tagged proteins were detected utilizing the NiNTA-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperone* were induced to high levels and 
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low level expression of soluble Bot B protein was observed, increasing from I to 4.0 hrs post 
induction (no expression detected in uninduced cells). 

A small scale IDA purification of BotB protein from a 250 ml PHI clarified 15% w/v 
extract (37.5 gm cell pellet) yielded approximately 12.5 mg protein, of which approximately 
50% was BotB protein and 50% was GroEL chaperone (assessed by Coomassie staining of a 
10% SDS-PAGE gel). The NiNTA alkaline phosphatase conjugate was utilized to detect his- 
tagged proteins on a Western blot containing total, soluble, soluble (PF.I clarified), soluble 
(after IDA column) and clution samples from the IDA column purification. The results 
demonstrated that all of the BotB protein produced by the pi lisBotB kan 
T7lac/pACYCGio/BI21(DE3) cells was soluble: the BotB protein was not precipitated by PEI 
treatment and was quantitatively bound by the IDA column. Since a I liter fermentation 
harvest yielded a 75 gm cell pellet, this indicated that the yield of soluble affinity purified hot 
13 protein from this fermentation was 12.5 mg/liter. These results also demonstrated that 
additional purification steps arc necessary to separate the chaperone proteins from the BotB 
protein. 



c) Fermentation Of pHisBotBkan laclq 
T7/pACYC(«ro/BL21(DE3) Cells 

Induction was with 4 gms IPTG at 2 hr post initiation of the glucose feed. The ()D M , 
was 46 at time of induction, then 56. 63. 69 and 71.5 at 1-4 hrs post induction. Viable colony 
counts decreased after induction (58. 3(5). 3. 0. 0 at 0-4 hr induction: bracketed numbers 
represent microcolonies: dilution 3 utilized 3 (.il of dilution 2 cells). All (53/53) colonies 
scored at the time of induction retained the BotB plasmid (kan resistant) and the chaperone 
plasmid (chloramphenicol resistant) and no colonies at induction grew on IPTG+kan plates 
(no mutations detected). 

Total and soluble extracts were resolved on a 10% SDS-PAGE gels and Western 
blotted and his-tagged proteins were detected utilizing the NiNT A-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperones were induced to high levels 
(observed by ponceau S staining), and a much higher expression of soluble Bot B protein 
(compared to expression in the pHisBotB kan T7lac/pACYCGro fermentation) was observed 
at all timepoints. including uninduced cells (some increase in BotB protein levels were 
observed after induction). 
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A small scale IDA purification of BotB protein from a 100 ml PEI clarified 15% w/v 
extract (15 gm cell pellet) yielded approximately 40 mg protein, of which approximately 50% 
was BotB protein and 50% was GroEL chaperone. as assessed by Coomassie staining of a 
10% SDS-PAGE gel. The NiNTA alkaline phosphatase conjugate was utilized to detect his- 
tagged proteins on a Western blot containing total, soluble, soluble (PEI clarified), soluble 
(after IDA column) and elution samples from the IDA column purification. The results 
demonstrated that a significant percentage (i.e.. -10-20 %) of BotB protein was soluble, that 
the solubilized protein was not precipitated by PEI treatment and was quantitatively bound by 
the IDA column. Since a 10 liter fermentation yielded a 108 gm cell pellet, this indicated that 
the yield of soluble affinity purified BotB protein from this fermentation was 144 mg/liter. 

In a scale up experiment. 2 liters of a 20% w/v PEI clarified lysate of pHisBotB'kan 
laclq T7/pACYCGro/BL2l(DE3) cells were purified on a large scale IDA column. The 
purification was performed in duplicate. The total yield of BotB protein was 220 and 325 
mgs protein in the two experiments (assuming I mg/ml solution - 2.0 OD :iII /ml). This 
represents 0.7% or 1.0%. respectively, of the total soluble cellular protein (assuming a PEI 
lystate having a concentration of 8 mg protein/ml and that the eluted material comprises a 1:1 
mixture of BoiB and folding chaperone). The NiNTA alkaline phosphatase conjugate was 
utilized to detect his-tagged proteins on a Western blot containing total, soluble, soluble (PEI 
clarified), soluble (after IDA column) and elution samples from the IDA column purification. 
These results demonstrated that a significant percentage (i.e.. -10-20 %) of the BotB protein 
was soluble, that the solubilized protein was not precipitated by PEI treatment and was 
quantitatively bound by the IDA column. Since a I liter fermentation harvest yielded a 108 
gm cell pellet, this indicated that the yield of soluble affinity purified BotB protein from the 
large scale purification was 60 mg or 89 mg/litcr. These results also demonstrated that 
further purification would be necessary to remove the contaminating chaperone protein. 

The above results provide methodologies for the purification of soluble BotB protein 
from fermentation cultures, in a form contaminated predominantly with a single £. voii 
protein (the folding chaperone utilized to enhance solubility). In the next example, methods 
are provided for the removal of the contaminating chaperone protein. 



- 197 - 



WO 98/08540 



PCT/US97/15394 



EXAMPLE 40 

Removal Of Contaminating Folding Chaperone Protein 
From Purified Recombinant C. hoiulinum Type B Protein 

In this example size exclusion chromatography and ultrafiltration was used to purify 
recombinant BotB protein from the folding chapcroncs and imidazole in IDA-purified 
material. 

To enhance the solubility of the recombinant BotB protein during scale-up. the protein 
was co-expressed with folding chaperones (see Ex. 39). During the Ni-IDA purification step, 
the folding chaperones co-eluted with the BotB protein in 800 mM imidazole: therefore, a 
second purification step was required to isolate the BotB free of folding chaperones. Lane 3 
of Figure 35 contains proteins eluted from an IDA column to which a lysate of pHisBotB kan 
laclq T7/pACYCGro/BL21(DE3) cells had been applied: the proteins were resolved on a 
4-15% polyacrylamide pre-cast gradient gel (Bio-Rad. Hercules. CA) run under native 
conditions and then stained with Coomassic blue. In Figure 35. lanes 1 and 4 contain proteins 
present in peak 1 and peak 2 from a Sephacryl S-100 column run as described below: lane 2 
is blank. 

As seen in lane 3 of Figure 35. the IDA-purified sample consists primarily of the 
folding chaperones and the BotB protein. The fact that the chaperones and the Bot B antigen 
appear as two distinct bands under native conditions suggested they were not complexcd 
together and therefore, it should be possible to separate them, using either a gradient of 
imidazole concentrations or size exclusion methods. 

In order to determine whether a gradient of imidazole concentrations could be used to 
separate the chaperone from the BotB protein, a step gradient using imidazole at 200. 400. 
000. and 800 mM in 50 mM sodium phosphate. 0.5 M NaCI and 10 % glycerol, pi I 6.8 was 
applied to an IDA column (containing proteins bound from a lysate of pHisBotB kan laclq 
r7/pACYCGro/BL21(DL3) cells). By narrowing the range of imidazole concentrations, it 
was hoped that the BotB and chaperone proteins would differentially elule at different 
concentrations of imidazole. Eluted proteins were monitored by absorbance at 280 nm and 
collected cither manually or with a fraction collector (BioRad). Protein was found to elute at 
200 and 400 mM imidazole only. 

Figure 36 shows a Coomassic stained SDS-PAGE gel containing protein eluted during 
the imidazole step gradient. Lane I contains broad range MW markers (BioRad). Lane 2 
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contains BotB protein purified by IDA chromatography of an extract of pHisBotB/BL21(DE3) 
pLysS cells grown in shaker flask culture (i.e.. no co-expression of chaperoncs: Ex. 35). 
Lane 3 contains a 20% w/v PEI clarified lysate of pHisBolB kan laclq 
T7/pACYCOro/BL2HDE3) ceils (/.<.-.. the lysate prior to purification by IDA 
chromatography). Lanes 4 and 5 contain protein which cluted at 200 or 400 mM imidazole, 
respectively. Lane 6 is blank. Lanes 7 and 8 contain 1/5 the load present in lanes 4 and 5. 

As shown in Figure 36. both the chaperonc and the BotB protein eluied in 200 mM 
imidazole, and more chaperone elutes in 400 mM imidazole, however no concentration of 
imidazole tested permitted the elution of BotB protein alone. Consequently, no significant 
purification was achieved using imidazole at these concentrations. 

Because of the considerable difference in molecular weights between the folding 
chaperone. which is a multimer with a total molecular weight around 400 kD (as determined 
on a Shodcx KB 804 s.zing column by HPLC). and the recombinant BotB nrote.n (molecular 
ueigh, around 50 kD). size exclusion chromatography was next examined for the ability «o 
separate these proteins. 



a) Size Exclusion Chromatography 

A column containing Sephacryl S-100 fIR (S-100) (Pharmacia) was poured (2.5 cm x 
24 cm : 1 10 m | bed volume). The column was equilibrated in a buffer consisting of 
phosphate buffered saline (lOmM potassium phosphate. 150 mM NaCI. pH 7.2) and 10 % 
glycerol (Mallinkrodt). Typically. 5 ml of the IDA-puritied BotB protein was filtered through 
" 0.45 A < syringe filter and applied to the column, and the equilibration buffer was pumped 
through the column at a How rate of I ml/minute. Fluted proteins were monitored bv 
ataorbancc at 280 nm and collected either manually or with a fraction collector. Appropriate 
tubes were pooled, if necessary, and the protein was quantitatcd by absorbance at 280 nm 
and/or by BCA protein assay. The isolated peaks were then analyzed by native and/or 
SDS-PAGE to identify the protein and evaluate the purity. 

Because of its larger size, the folding chaperone eluted first, followed bv the 
recombinant BotB protein. A smaller third peak was observed which failed to stain when 
analyzed by SDS-PAGE and therefore was presumed to be imidazole. 

SDS-PAGE analysis (12.5% polyaerylamide. reduced samples) was used to evaluate 
the purity of the IDA-purilied recombinant BotB protein before and after S-100 purification. 
The results are shown in Figure 33. 
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In Figure 33. lane I contains broad range MW markers (BioRad). Lane 5 contains 
IDA-purifled BotB protein. Lane 6 contains IDA-purified BotB protein following S-I00 
purification. Lane 7 is blank (lanes 2-4 were discussed in Ex. 34 above). 

The results shown in Figure 33 show that the IDA-purified BotB is significantly 
contaminated with the folding chaperone (molecular weight about 60 kl) under reducing 
conditions: lane 6). Following S-100 purification, the amount of folding chaperone present in 
the BotB sample was reduced dramatically (lane 7). Visual inspection of the Coomassie 
stained SDS-PAGL gel revealed that after S-100 purification. > 90% of the total protein 
present was BotB. 

The IDA-purificd BotB and the S-100-purified BotB samples were analyzed by HPLC 
on a size exclusion column (Shodex KB 804); this analysis revealed that the BotB protein 
represented 64% of the total protein in the IDA-purified sample and that following S-100 
purification, the BotB protein represented >95% of the total protein in the sample. 

The IDA-purified BotB material was also applied to a ACA 44 (SpeetraPor. Houston. 
I'X) column. The ACA 44 resin is equivalent to the S-100 resin and chromatography using 
the ACA 44 resin was carried out exactly as described above for the S-100 resin. The ACA 
44 resin was found to separate the recombinant BotB protein from the folding chaperone. 
I he ACA 44-purilled BotB sample was analyzed for endotoxin using the LAL assay 
(Associates of Cape Cod) as describe in Hxample 24. Two aliqouts of the ACA 44-purified 
BotB preparation were analyzed and were found to contain either 58 to I 16 l-li/mg 
recombinant protein or 94 to 189 FU/mg recombinant protein. 

Ihese results demonstrate that size exclusion chromatography can be used to purify the 
recombinant BotB protein from the folding chaperone and imidazole in IDA-purified material. 

b) Ultrafiltration For The Separation Of Recombinant BotB 
Protein And Chupcroncs 

Ultrafiltration was examined as an alternative method for the separation recombinant 
BotB protein and folding chaperones in IDA-purified material. While in this example only 
mixtures of BotB and chaperones were separated by ultrafiltration, this technique is suitable 
for use with recombinant BotA and BotL proteins as well provided that the wash buffers used 
are altered as necessary to take into account different requirements for solubility. 

The recombinant BotB protein and folding chaperones were separated using a two-step 
sequential ultrafiltration method. The first membrane used had a nominal molecular weight 
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cutoff (MWCO) of approximately 100 kD: this membrane retains the larger folding chaperonc 
while allowing the smaller recombinant protein to pass through. The addition of several 
volumes of wash buffer may be required to efficiently wash the recombinant protein through 
the membrane. The second step utilized a membrane with a nominal MWCO of 
approximately 10 kD. During this step, the recombinant antigen was retained by the 
membrane and could be concentrated to the degree desired and the imidazole and excess wash 
buffer passed through the membrane. 

Twenty-seven milliliters of an IDA-purified BotI) preparation was ultrafiitercd through 
a 47 mm YM 100 (100 kD MWCO) membrane (Amicon) in a 50 ml stirred cel. (Amicon). 
The membrane was washed in dd IKO prior to use as recommended by the manufacturer. Six 
volumes of 10% glycerol in PBS were washed through ,o remove most of the recombinant 
BotB proicin and this wash was collected in a separate vessel. The resulting BotB 
protein-rich Citrate was then concentrated 12-fold using a YM 10 (10 kD MWCO) membrane 
(Amicon). to a final volume of 14 ml. The YM 100 and YM 10 concentrates were analvzcd 
along with the lysine starting material by native PAGE using a 4 -15% pre-eas. gradient gel 
(BioRad). The results are shown in Figure 37. 

In Figure 37. lane 1 conlains IDA-purified BotB derived from a shaker flask culture 
no co-expression of chaperoncs: lis. 35): lane 2 contains a 20% w/v PR! clarified Ivsatc 
of pHisBotB kan lack, T7/pACYCGro/BL21(DE3) cells: lane 3 shows the lysate of lane 3 
after IDA purification: lane 4 contains the YM 10 concentrate and lane 5 contains the YM 
100 concentrate. 

The results shown in Figure 37 demonstrate that .he recombinant BotB protein can be 
purified away from the folding chapcrone by ultrafiltration through a 100 kD MWCO 
membrane (lane 4). leaving the chaperone protein in the 100 kD concentrate (lane 5). 
Analysis of the sample in lane 5 also showed that very little of the BotB protein was retained 
by the 100 kD MWCO membrane after 6 volumes of wash buffer had been applied. 

The BotB samples following IDA chromatography and following ultrafiltration through 
•lie YM 100 membrane were anlyzed by HPLC on a size exclusion column (Shodex KB 804)- 
this analysis revealed that the BotB protein represented 64% of the total protein in the IDA- 
purilicd sample and that following ultrafiltration through the YM 100 membrane, the BotB 
protein represented >%% 0 f the total protein in the sample. 

The BotB protein purified by ultrafiltration through the YM 100 membrane was 
examined for endotoxin using the LAL assay (Associates of Cape Cod) as describe in 



WO 98/08540 



PCT/US97/15394 



Example 24. Two aliqouts of the YM 100-puriticd BotB preparation were analyzed and were 
found to contain either 18 to 36 EU/mg recombinant protein or 125 lo 250 EU/mg 
recombinant protein. 

The above results demonstrate that size exclusion chromatography and ultrafiltration 
can be used to purify recombinant botulinal toxin proteins away from folding chaperones, 

EXAMPLE 41 

Cloning And Expression Of The C Fragment 
Of The C. hotulinum Serotype F Toxin Gene 

['he C. hotulinum type E neurotoxin gene has been cloned and sequenced from several 
different strains fPoulet ei al. (1992) Biochem. Biophys. Res. Commun. 183:107 (strain 
Beluga): Whclan el al. (1992) Eur. J. Biochem. 204:657 (strain NCTC I 1219); Fujii ei ai. 
(1990) Microbiol. Immunol. 34:1041 (partial sequence of strains Mashike. Iwani and Otaru) 
and Fujii et al. (1993) J. Gen. Microbiol, 139:79 (strain Mashike)]. The nucleotide sequence 
of the type E toxin gene is available from the EMBL sequence data bank under accession 
numbers X62089 (strain Beluga) and X62683 (strain NCTC 11219). The nucleotide sequence 
of the coding region (strain Beluga) is listed in SEQ ID NO:49. The amino acid sequence of 
the C. hotulinum type E neurotoxin derived from strain Belgua is listed in SEQ ID NO:50. 
The nucleotide sequence of the coding region (strain NCTC 1 1219) is listed in SEQ ID 
NO:51. The amino acid sequence of the ('. hotulinum type E neurotoxin derived from strain 
NCTC 1 1219 is listed in SEQ ID N():52. 

The DNA sequence encoding the native (/. hotulinum serotype E C fragment gene 
derived from the Beluga strain can be expressed as a histidinc-tagged protein using the 
pETHisb vector: the resulting coding region is listed in SEQ ID NO:53 and the corresponding 
amino acid sequence is listed in SEQ ID NO:54. The DNA sequence encoding the C 
fragment of the native ('. hotulinum serotype E gene derived from the NCTC 1 1219 strain 
can be expressed as a hislidine-iagged fusion protein using the pETHisb vector: the resulting 
coding region is listed in SEQ ID NO:55 and the corresponding amino acid sequence is listed 
m SEQ ID NO:56. The C fragment region from any strain of C. hotulinum serotype E can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from ('. hotulinum type E 2231 strain (ATCC //1 7786). 
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The type E neurotoxin gene is synthesized as a single polypeptide chain which may be 
converted to a double-chain form (i.e.. a heavy chain and a light chain) by cleavage with 
trypsin: unlike the type A neurotoxin, the type E neurotoxin exists essentially only in the 
single-chain form. The 50 kD carboxy-terminal portion of the heavy chain is referred to as 
ihe C fragment or the H ( . domain. Expression of the C fragment of C. hoiulimm, type E toxin 
in heterologous hosts (e.,i>.. E. coii) has not been previously reported. 

The native C fragment of the C. botulinum serotype R toxin (BotE) gene was cloned 
and inserted into expression vectors to facilitate expression of the recombinant BotE protein in 
E. cnli. This example involved PCR amplification of the gene, cloning, and construction of 
expression vectors. 

The BotE serotype gene was isolated using PCR as described for the BolA serotype 
gene in Example 28. The C boiulmum type E strain was obtained from the American Type 
C ulture Collection (ATCC #17786: strain 2231). The following primer pair was used in the 
PCR amplification: 3 -CGCC ATGGCTCTTTC I TCTTAT ACAGATGAT-3" (5" primer, 
engineered ,Vtv>I site underlined) (SEQ ID NO:57) and 

>-«CmKn7TA7T7TKTYGCCATCCATG-y ( y primer, engineered //mdlll site 
underlined, native gene termination codon italicized) (SEQ ID NO:58). The PCR product was 
inserted into pCRscript as described in Example 28. The resulting pCRscript BotE clone was 
confirmed by restriction digestion, as well as. by obtaining the sequence of approximately 300 
bases locaied at the 5" end of the C fragment coding region using standard DNA sequencing 
methods. The resulting BotE sequence was identical to that of the published C. hmulinum 
type E toxin sequence fWhelan el al (1992). supra). 

The A/jdffilledV/y/ndlll fragment from a pCRscript BotE recombinant was cloned into 
pETHisb vector as described for BotA C fragment in Example 28. The resulting construct 
was termed pHisBolE. pHisBolE expresses the BotE gene under the control of the T7 lac 
promoter and the resulting protein contains an N-terminal lOXHis-tag affinity lag. 

The pIlisBotE expression construct was transformed into BL2KDE3) pLysS competent 
cells and 1 liter cultures were grown, induced and his-tagged proteins were purified utilizing a 
NiNTA resin (elutcd in low pH elution buffer) as described in Example 28. Total, soluble 
and purified proteins were resolved by SDS-PAGE and detected by Coomassic staining. The 
results are shown in Figure 38. 

In Figure 38. lane 1 contains broad range MW markers (BioRad): lane 2 contains a 
total protein extract: lane 3 contains a soluble protein extract: lane 4 contains proteins present 
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in the ilow through from the NiNTA column (this sample was not diluted prior to loading and 
therefore represents a load 5X that of the load applied for the total and soluble extracts in 
lanes 2 and 3): lane 5 contains proteins eluted from the NiNTA column: lane 6 contains 
protein eluted from a NiNTA column which had been stored at -20°C for 1 year. 

The pllisBotF. protein was expressed at moderate levels (7 mg/liter) as a totally soluble 
protein. The purified protein migrated as a single band of the predicted MW. 

Western blot hybridization utilizing a chicken anti-C. hotulinum serotype E toxoid 
primary antibody (generated by immunization of hens as described in Example 3 using C. 
hotulinum serotype E toxoid ) was also performed on the total, soluble and purified BotE 
proteins. Samples of BotA and BotB C fragments were also included on the gels to facilitate 
MW and inimunogenicity comparisons. Strong immunoreactivity was detected using the 
anti-C. hotulinum type E toxoid antibody only with the BotE protein. 

These results demonstrate that the native BotE gene sequences can be expressed as a 
•soluble liis-tagged protein in E. coli and purified by mctal-ehelalion affinity chromatography. 

EXAMPLE 42 

Generation Of Neutralizing Antibodies Using The Recombinant pHisBotE Protein 

The ability of the purified pHisBotE protein to generate neutralizing antibodies was 
20 examined. Nine BAl.Bc mice were immunized with BotE protein (purified as described in 
Ex. 41) using (ierbu GMDP adjuvant (CC Biotech). The low pH eiution was mixed with 
Gerbu adjuvant and used to immunize mice. Each mouse received a subcutaneous injection 
of 100 ul antigen/adjuvant mix (35 ug antigen + I ug adjuvant) on day 0. Mice were 
subcutnneously boosted as above on day 14 and bled on day 28. Mice were subsequently 
25 boosted and bled on day 70. 

Anti-C. hotulinum serotype V. toxoid titers were determined in day 28 serum from 
individual mice from each group using the EIJSA protocol outlined in Example 29 with the 
exception that the plates were coated with C. hotulinum serotype E toxoid, and the primary 
antibody was a chicken anti-C. hotulinum serotype E toxoid. Seroconversion (relative to • 
30 control mice immunized with the p6xllisBotA antigen (Ex. 29)1 was observed with all 9 mice 
immunized with the purified pHisBotE protein. 

The ability of the anti-BotE antibodies to neutralize native ( ' hotulinum type E toxin 
was tested in a mouse-C. hotulinum neutralization model using pooled mouse serum (see Ex. 
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23b). The LD,„ of purified C. botulmum type E toxin complex (Dr. Eric Johnson. University 
of Wisconsin. Madison) was determined by a intraperitoneal (IP) method [Schantz and Kautler 
(1978). supra| using 18-22 g female ICR mice. The amount of neutralizing antibodies present 
in the serum of the immunized mice was determined using serum antibody titrations. The 
various serum dilutions (0.01 ml) were mixed with 5 LD„, units of C ho,ulinu m type E toxin 
and the mixtures were injected IP into mice. The neutralizations were performed in duplicate. 
The mice were then observed for signs of botulism for 4 days. Undiluted serum from day 28 
did not protect, while undiluted. 1/10 diluted and 1/100 diluted day 70 serum protected (1005 
of animals) while 1/1000 diluted day 70 serum did not. This corresponds ,o a neutralization 
titer of 50-500 IU/ml. 

These results demonstrate that seroconversion occurred and neutralizing aniibodies 
were induced when the recombinant Both protein was utilized as the immunogen. 

EXAMPLE 43 

Construction Of Vectors To Facilitate Expression 
Of His- Tagged BotE Protein In Fermentation Cultures 

A number of expression vectors were constructed to facilitate the expression of 
recombinant BotE protein in large scale fermentation culture. These constructs varied as to 
the strength of the promoter utilized (T7 or T7lac) and the presence of repressor elements 
'lack,, on the plasmid. The resulting constructs varied in the level of expression achieved and 
m plasmid stability which facilitated the selection of a optimal expression system lor 
fermentation scaleup. This example involved a) construction of BotE expression vectors and 
b) determination of expression levels in small scale cultures. 

a) Construction Of BotE Expression Vectors 

The BotE expression vectors created for fermentation culture were engineered to 
utilize the kanamycin rather than the ampicillin resistance gene, and contained either the T7 
T7lac promoter, with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBotE 
Protein described in Example 41. with the only differences between Cones being the alterat 
ol various regulatory elements. Using the designations outlined below, the pHisBotE clone 
(Ex. 41) is equivalent to pHisBotE amp T7lac. 



or 



ion 
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i) Construction Of pHisBotE kan laclq T7Iac 

pHisBotE kan laclq T7lac was constructed by inserting the Xhal/Hindlll fragment of 
pHisUotE which contains the BotE gene sequences into ATwI/tfmdlH-clcaved pET24a vector. 
Proper construction was confirmed by restriction digestion. 

ii) Construction Of pHisBotE kan T7 

pHisBotE kan T7 was constructed by ligating the BotE-eontaining XhaMSupX fragment 
of pHisBotE kan lacJqT7lac to the T7 promoter-containing XhaMSup] fragment of pET23a. 
Proper construction was confirmed by restriction digestion. 

iii) Construction Of pHisBotK kan laclqT7 

pHisBotE kan ladqT7 was constructed by inserting the D^l\MHmd\\\ fragment from 
pHisBotE kan 17 which contains the Both gene sequences into fl,v/II//-//wdUI-cleaved pET24 
vector. l > roper construction was confirmed by restriction digestion. 

h) Determination Of BotE Expression Levels In Small Scale 
Cultures 

I he three BotE kan expression vectors described above were transformed into 
BI2KDE3) competent cells and 50 ml (2XYT + 40 ng/ml kan) cultures were grown and 
induced with ITPG as described in Example 28. Total and soluble protein extracts from 
before and after induction made as described in Example 28, The total and soluble extracts 
were resolved on a 12.5% SDS-PAGE gel. and his-tagged proteins were detected on a 
Western blot utilizing the NiNTA-alkalinc phosphatase conjugate as described in Example 
31(c)(iii>. The results showed that all three BotE cell lines expressed his-tagged proteins of 
the predicted MW for the BotE protein upon induction. The results also demonstrated that 
the two constructs that contained the T7 promoter expressed the Boll- protein before 
induction, while the T7lac promoter construct did not. Upon induction, the T7 promoter- 
containing constructs induced to higher levels than the T7lac-containing construct, with the 
pllisBotli kan lacIqT7/B121(DE3) cells accumulating the maximal levels of BotE protein. 
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EXAMPLE 44 

Expression And Purification Of pHisBotE From Fermentation Cultures 

Based on the small scale inductions performed in Example 43. the pHisBotE kan laclq 
T7/BI2KDF.3) strain was selected for fermentation scaleup. This example involved the 
fermentation and purification of recombinant BotE C fragment protein. 

A fermentation with the pHisBotE kan laclq T7/BI2I(DF.3) strain was performed as 
described in Example 31. The fermentation culture was induced 2 hrs post start of the 
glucose feed with 4 gm IPTG (final concentration = 1.6 mM). The OD mt was 42 at time of 
induction, then 46.5. 48. 53 and 54 at 1-4 hrs post induction. Viable colonv counts decreased 
Imm 0-4 hr induction 1131. 4 (28). 7 (3), 7. 8; dilution 3 utilized 6 ul of dilution 2 cells: 
bracketed colonies are microcolonies). All (32/32) colonies scored at the time of induction 
retained the BotE plasmid (kan resistant) and no colonies at induction grew on IPTG+Kan 
Plates <no mutations detected). These results were indicative of strong promoter induction, 
since colony viability reduced after induction, and the culture stopped growing during 
fermentation (stopped at 54 OD„ (1() /ml). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGE gel and total protein 
was detected by staining with Coomassie blue. The results are shown in Figure 39. 

In Figure 39. lane I contains total protein from a pHisBotA kan T7 lac/BI2l(DE3) 
pLysS fermentation (Ex. 24). Lanes 2-9 contain extracts prepared from the above pHisBotE 
kan luck, T7/BI2KDE3) fermentation: lanes 2- 4 contain total protein extracts prepared at 0. I 
and 2 hours post-induction. respectively. Lane 5 contains a soluble protein extract prepared at 
2 hours post-induction. Lanes 6 and 7 contain total and soluble extracts prepared at 3 hours 
post-induction, respectively. Lanes 8 and 9 contain total and soluble extracts prepared at 4 
hours post-induction, respectively. Lane 10 contains broad range MW markers (BioRad). 

The results shown in Figure 39 demonstrate that moderate level induction of totally 
soluble Bot K protein was observed, increasing from I to 4 hrs post induction (no expression 
was detected in uninduccd cells). From a 2 liter fermentation harvest a 155 gm (wet wt) cell 
pellet was obtained and used to make a PEI-clarified lysate (I liter in CRB. pll 6.8). The 
lysate was applied to a large scale IDA column and 200 my of BotE protein, which was 
found to be greater than 95% pure (as judged by visual inspection of a Coomassie stained 
SDS-PAGE gel), was recovered. This represents 2.5% of the total soluble cellular protein 
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(assuming a PEI lysatc having a concentration of 8 mg protein/ml) and corresponds to a yield 
of !()() mg BotE protein/liter of fermentation culture. 

The above results demonstrate that high levels of the recombinant BotE protein can be 
expressed and purified from fermentation cultures. 

EXAMPLE 45 

Removal Of Imidazole From Purified Recombinant BotE Protein Preparations 

The expression of recombinant BotE protein, unlike the BotA and BotB proteins, did 
not require the presence of folding chaperones to maintain solubility during scale-up. A size 
exclusion chromatography step was included however to remove the imidazole from the 
sample and exchange the IDA elution buffer for one consistent with the BotA antigen. 

A Scphacryl S-100 I IR (S-100: Pharmacia) column was poured (2.5 cm x 24 cm: bed 
volume 110 ml). Under these conditions, the BotE protein should be retained by the beads 
to a lesser degree than the smaller imidazole, therefore the BotE protein should elute from the 
column before the imidazole. The column was equilibrated in a buffer consisting of 50 mM 
sodium phosphate. 0.5 M NaCI. and 10% glycerol (all reagents from Mallinkrodt). five 
milliliters of the IDA-puritied BotE protein (Ex. 44) was liltered through a 0.45 t i syringe 
tiller and applied to the S-100 column, and equilibration buffer was pumped through the 
column at a How rate of I ml/minute. Eluled proteins were monitored by absorbance at 280 
nm. and collected either manually or with a fraction collector. Appropriate tubes were pooled 
if necessary, and the protein was quantitated by absorbance at 280 nm and/or IK A protein 
assay. The isolated peaks were then analyzed by native and/or SOS-PAGE u> identify the 
protein(s) and evaluate the purity. 

Figure 40 provides a representative chromatogram showing the purification of 
IDA-piirilicd BotE on the S-100 column. Even though folding chaperones were not 
over-expressed with this antigen, a small amount of protein eluted at a time consistent with 
the folding chaperones expressed with BotA and BotB proteins (Gro) (see the fust peak). The 
second peak in the chromalogram contained the Boll- protein, and the third peak was 
presumably imidazole. This presumed imidazole peak was isolated in comparable levels in 
IDA-purilled BotA and BotB protein preparations as well. 
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These results demonstrate that size exclusion chromatography can be used to remove 
imidazole and traces of contaminating high molecular weight proteins from IDA-puriflcd 

BotF protein preparations. 

The S-100-puritled BoiE protein was tested for endotoxin contamination usina the 
LAI. assay as described in Example 24. This preparation was found to contam 64 to 128 
lilJ/mg recombinant protein and is therefore substantially free of endotoxin. 

The S-100 purilled Bo.E was mixed with purified preparations of BotA and BotB 
prolans and used to .mmunize mice: 5 ug of each Bot protein was used per immunization 
and alum was included as an adjuvant. After two immunizations with this triva.cn, vaccine 
the .mmunized mice were ehallanged with C ho,u,inu m toxin. The immunized mice 
contained neutralizing antibodies sufficient to neutralize between 100.000 to 1.000.000 I D, 
ol either toxin A or toxin B and between 1.000 to 10.000 I.D„, of toxin li. The titer of 
neutralizing anybodies directed against toxin E would be expected ,o increase following 
subsequent boosts with the vuecine. These results demonstrate that a triva.en, vaccine 
conta.nmg recombinant BotA. BotB and BotE proteins provokes neutralizing antibodies. 

EXAMPLE 46 

Expression Of The C Fragment Of The ( '. hoiulinmn 
Serotype C Toxin Gene And Generation Of Neutralizing Antibodies 

The C . /„„„/„„„„ type CI neurotoxin gene has been cloned and sequenced fKimura e, 
<"■<"*(» Bioehen, Biophys. Res. Comm. .71:1304]. The nucleotide sequence of the toxin 
Bene derived from the C hmulimm type C strain C-Stockho.m is available Iron, the 
l:MBI./GenBank sequence data banks under the accession number IW02I0: the nucleotide 
•sequence of the coding region is listed in SEQ ID NO:59. The amino acid sequence of the C 
W/,„«„ type CI neurotoxin derived from this strain is listed in SEQ II) NO:60. 

The DNA sequence encoding the native C. hatulinum serotvpe C. C fragment gene 
denved from the C-Stockho.m strain can be expressed using the pETHisb vector: the resulting 
cod.ng region is listed in SEQ ID NO:61 and the corresponding amino acid sequence is listed 
«n SEQ ID NO:62. The C fragment reg.on from any strain of C. /,„,„/,,„„„ . scro tvpe C can be 
ampl,l,ed and expressed using the approach illustrated below using the C fraument derived 
trom C. /„„,„;,„„„ type C C-Stockholm strain. Expression of the C fragment of C W/*,„ 
<vpe CI toxin in heterologous hosts (,.,.. L co li) has not been previously reported 
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The C fragment of the C hatulinum serotype CI (BotCI) toxin gene is cloned using 
the protocols and conditions described in Example 28 tor the isolation of the native BotA 
gene. A number of C. hatulinum serotype C strains (expressing either or both CI and C2 
toxin) arc available from the ATCC [e.g.. 2220 (ATCC 17782). 2239 (ATCC 17783). 2223 
5 (ATCC 17784: a type C-fJ strain; C-(5 strains produce C2 toxin). 662 (ATCC 17849: a type 
C-u strain: C-ra strains produce mainly CI toxin and a small amount of C2 toxin). 2021 
(ATCC 17850: a type C-a strain) and VP] 3803 (ATCC 25766)). Alternatively, other type C 
strains may be employed for the isolation of sequences encoding the C fragment of C. 
hatulinum serotype C toxin. 

10 The following primer pair is used to amplify the BotC gene: 5 -CG CCATGG C 

TTTATTAAAAGATATAATTAATG-3" [5' primer, engineered Ncal site underlined (SRQ ID 
NO:63)] and 5^GCMQ<nZ7T/l"rTCAClTACA(iGTAC AAAACC-3" |3* primer, 
engineered /Ymdlil site underlined, native gene termination codon italicized (SEQ ID 
NO:64)|. following PCR amplification, the PCR product is inserted into the pCRscripl vector 

l> and then the 1.5 kb fragment is cloned into pMTHisb vector as described for BotA C fragment 
gene in Example 28. The resulting construct is termed pHisBotC. Proper construction is 
confirmed by DNA sequencing of the BotC sequences contained within pilisBolC. 

pHisBotC expresses the BotC gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-lerminal l()XHis-tag affinity tag. The 

20 pHisBotC expression construct is transformed into BL2KDE3) pLysS competent cells and I 
liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
(eluted in 250 niM imidazole. 20% glycerol) as described in Example 28. Total, soluble and 
purified proteins arc resolved by SDS-PAGE and detected by Coomassie staining and Western 
blot hybridization utilizing a Ni-NTA-alkaline phosphatase conjugate (Qiagen) which 

25 recognizes his-tagged proteins as described in Example 31(c)(iii). This analysis permits the 

determination of expression levels of the pHisBotC protein {i.e.. number of mg/iiter expressed 
as a soluble protein). The purified BotC protein will migrate as a single band of the predicted 
MW (i.e.. -50kD). 

The level of expression of the pHisBotC protein may be modified (increased) by 
30 substitution of the T7 promoter for the T7iac promoter, or by inclusion of the laclq gene on 
the expression plasm id. and plasmid expressed in BL2KDE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (i.e.. less than 0.5%) of soluble 
pHisBotC protein are expressed using the above expression systems, the pHisBotC construct 



- 210 - 



WO 98/08540 



PC17US97/15394 



may be co-expressed with pACYCGro construct as described in Example 32. In this case, the 
recombinant BotC protein may co-purify with the folding chaperoncs. The contaminating 
chaperones may be removed as described in Example 34. Preparations of purified pHisBotC 
protein arc tested for endotoxin contamination using the LAL assay as described in Fixample 



24. 



The purifed pHisBotC protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotC protein using Gerbu GMDP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anti-BotC antibodies to neutralize native C. 
homlinum type C toxin is demonstrated using the mouse-C. hoiulinum neutralization model 
10 described in Example 36. 



20 



30 



EXAMPLE 47 

Expression Of The C fragment Of The C. hoiulinum 
Serotype D Toxin Gene And Generation Of Neutralizing Antibodies 

The C hoiulinum type D neurotoxin gene has been cloned and sequenced [Sunagawa 
et al. (1992) J. Vet. Med. Sci. 54:905 and Binz at al. (1990) Nucleic Acids Res. 18:5556]. 
The nucleotide sequence of the loxin gene derived from the CB16 strain is available from the 
EMBL/GenBank sequence data banks under the accession number S49407: the nucleotide 
sequence of the coding region is listed in SEQ ID NO:65. The amino acid sequence of the C. 
homlinum type D neurotoxin derived from the CBI6 strain is listed in SEQ ID NO:66. 

The DNA sequence encoding the native C. hoiulinum serotype D C fragment gene 
derived from a BotD expressing strain can be expressed using the pETMisb vector: the 
resulting coding region is listed in SEQ ID NO:67 and the corresponding amino acid sequence 
is listed in SEQ ID NO:68. The C fragment region from any strain of C. hoiulinum serotype 
D can be amplified and expressed using the approach illustrated below using the C fragment 
derived from C hoiulinum type D CBI6 strain. Expression of the C fragment of C 
hoiulinum type D toxin in heterologous hosts (e.g.. E. coli) has not been previously reported. 

The C: fragment of the C. hotulinum serotype D (BotD) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BolA gene. 
A number of C. hoiulinum type D strains are available from the ATCC |c\tf., ATCC 9633. 
2023 (ATCC 17851). and VPI 5995 (ATCC 27517)]. 
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The following primer pair is used to amplify the BotD gene: 5'-CGCCATGGC 
TTTATTAAAAGATATAATTA ATG-3 [5" primer, engineered Nva\ site underlined (SEQ ID 
NO:63)| and 5^GCMG£rT7T.4CTCTACCCATCCrGGATCCCT-.V \y primer, engineered 
//mdlll site underlined, native gene termination codon italicized (SEQ ID NO:69)]. 
Following PCR amplification, the PGR product is inserted into the pCRscript vector and then 
the 1.5 kb fragment is cloned into pETHisb vector as described for BotA C fragment gene in 
Example 28. The resulting construct is termed pHisBotD. 

pHisBotD expresses the BotD gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
pHisBotD expression construct is transformed into BL2I(DE3) pLysS competent cells and I 
liter cultures arc grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassie staining and Western blot hybridization utilizing a Ni-NTA- 
alkaline phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
Example 3 1 (c)(iii). This analysis permits the determination of expression levels of the 
pHisBotD protein [i.e.. number of mg/liter expressed as a soluble protein). The purified BotD 
protein will migrate as a single band of the predicted MW (i.e.. - 50kD). 

The level of expression of the pHisBotD protein may be modified (increased) by 
substitution of the T7 promoter for the T7lac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BI.21(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels less than about 0.5%) of 
soluble pHisBotD protein arc expressed using the above expression systems, the pHisBotD 
construct may be co-expressed with pACYCGro construct as described in Example 32. In 
this case, the recombinant BotD protein may co-purify with the folding chaperones. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pHisBotD protein are tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotD protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotD protein using Gerbu GMDP adjuvant «X Biotech) as 
described in Example 36. The ability of the anti-BotD antibodies to neutralize native C 
hmulmum type D toxin is demonstrated using the mouse-C hotulinum neutralization model 
described in Example 36. 
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EXAMPLE 48 

Expression Of The C Fragment Of The C. botulinum 
Serotype F Toxin Gene And Generation Of Neutralizing Antibodies 

The C botulinum type F neurotoxin gene has been cloned and sequenced [East el al. 
( 1 W2> FEMS Microbiol. Lett. 96:225], The nucleotide sequence of the toxin gene derived 
from the 2021- strain (ATCC 23387) is available from the EMBL/GcnBank sequence data 
banks under the accession number M92906: the nucleotide sequence of the codinu region is 
lisied in SF.Q ID N():70. The amino acid sequence of the C. botulinum type F neurotoxin 
derived from the 202F strain is listed in SEQ ID NO:71. 

The DNA sequence encoding the native C. botulinum serotype F C fragment gene 
derived from the 202F strain can be expressed using the pETHisb vector: the resul.inu coding 
region is listed in SEQ ID NO:72 and the corresponding amino acid sequence is listed in SEQ 
ID NO: 73. The C fragment region from any strain of C. botulinum serotype F can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from C /„„„/,„,„„ , vpe p 2021" strain. Expression of the C fragment of C /„„„//„,„„ type F 
toxin in heterologous hosts {c. K .. E. co/i) has not been previously reported. 

The C fragment of the C. botulinum serotype F (BotF) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 
The C : botulinum type F 202F strain is obtained from the American Type Culture Collection 
(ATCC" 23387). Alternatively, sequences encoding the BotF toxin may be isolated from any 
Boil- expressing strain \ V . K .. VPI 4404 (ATCC 25764). VPI 2382 (ATCC 27321 ) and 
Langeland (ATCC 35415)). 

The following primer pair is used to amplify the BotF gene: .V-CG CC'ATGG C 
TATTCTAATTATATATTTTAATAG-3- \5' primer, engineered .\V„I site underlined (SFQ 
ID NO:74)J and 5--GCAAGCJTrC^TTCTTTCCATCCATTCTC-3- [3' primer, engineered 
MindlU site underlined, native gene termination codon italicized (SFQ ID NO:75». 
Following PGR amplification, the PCR product is inserted into the pCRscrip, vector and then 
»hc 1.5 kb fragment is cloned into pETHisb vector as described for BotA C fragment gene in 
Example 28. The resulting construct is termed pHisBotF. 

PHisBotF expresses the BotF gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXT Iis-tag affmitv tau. The 
PHisBotF expression construct is transformed into BL2l(DK3) pLysS competent cells and I 
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liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassic staining and Western blot hybridization utilizing a Ni-NTA- 
alkalinc phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
Example 31(c)(iii). This analysis permits the determination of expression levels of the 
pHisBotF protein {i.e.. number of mg/liter expressed as a soluble protein). The purified BotF 
protein will migrate as a single band of the predicted MW (i.e.. - 50kD). 

The level of expression of the pHisBotF protein may be modified (increased) by 
substitution of the T7 promoter for the T71ac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL2I(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (i.e.. less than about 0.5%) of 
soluble pHisBotF protein are expressed using the above expression systems, the pHisBotF 
construct may he co-expressed with pACYCGro construct as described in Example 32. In 
this case, the recombinant BotF protein may co-purify with the folding chapcroncs. The 
15 contaminating chaperoncs may be removed as described in Example 34. Preparations of 
purified pHisBotF protein arc tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotF protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotF protein using Gerbu GMDP adjuvant (CC Biotech) as 
20 described in Example 36. The ability of the anti-BolF antibodies to neutralize native C \ 

hotulimtm type F toxin is demonstrated using the mouse-C. hotulimtm neutralization model 
described in Example 36. 



EXAMPLE 49 

Expression Of The C Fragment Of The (*. hotulimtm 
Serotype G Toxin Gene And Generation Of Neutralizing Antibodies 

I he ( \ hotulinum type G neurotoxin gene has been cloned and sequenced (Campbell et 
aL (1993) Bioehimica el Biophysica Acta 1216:487 and Binz et ul. (1990) Nucleic Acids Res. 
18:5556], The nucleotide sequence of the toxin gene derived from the 1 13/30 strain (NCFB 
3012) is available from the EMBL/GenBank sequence data banks under the accession number 
X74162: the nucleotide sequence of the coding region is listed in SEQ ID NO:76. The amino 
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acid sequence of the C. botulimim type G neurotoxin derived from this strain is listed in SEQ 
ID NO:77. 

The DNA sequence encoding the native C. botulimim serotype G C fragment gene 
derived from the 1 13/30 strain can be expressed using the pETHisb vector: the resulting 
coding region is listed in SEQ ID NO:78 and the corresponding amino acid sequence is listed 
in SEQ ID NO:79. The C fragment region from any strain of C. hoiulinwn serotype G can 
be amplified and expressed using the approach illustrated below using the C fragment derived 
from C. tuuuhnum type Ci 113/30 strain. Expression of the C fragment of C. bomlimm type 
G toxin in heterologous hosts ((?.#.. E. coli) has not been previously reported. 

The C fragment of the C. bomlinum serotype G (BotG) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 
The C. hondnmm type G 1 13/30 strain is obtained from the NCFB. The following primer 
pair is used to amplify the BotG gene: 5 '-CG CC ATGG CTG AC ACAATTTTAATACA 
AG 1-3" [y primer, engineered Nco\ site underlined (SEQ ID NO:80)| and 
5-<iCCJXIJ^rr,frTCTGTCCATCCTTCATCCAC-3- |3" primer, engineered A7wl site 
underlined, native gene termination codon italicized (SEQ ID NO:8l)J. following PGR 
amplification, the PGR product is inserted into the pCRscript vector and then the 1.5 kb 
fragment is cloned into pETHisb vector as described for BotA C fragment gene in Example 
28 with the exception that the sequences encoding BotG are excised from the pCRscript 
vector by digestion with Nco\ and ATwl and the Nco\ site is blunted (the BotG sequences 
contain an internal ///mill I site). This AV«l(fillcd)/A'fojI fragment is then ligated to the 
pETHisb vector which has been digested with Nhd and Sail and the Nhel site is blunted. 
The resulting construct is termed pHisBotG. 

pHisBotG expresses the BotG gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
pHisBotG expression construct is transformed into BL2KDE3) pLysS competent cells and I 
liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassie staining and Western blot hybridization utilizing a N'i-NTA- 
alkaline phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
Example 3 l(c)(iii). This analysis permits the determination of expression levels of the 
pHisBotG protein (i.e.. number of mg/litcr expressed as a soluble protein). The purified BotG 
protein will migrate as a single band of the predicted MW (i.e.. ~50kD). 
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The level of expression of the pHisBotG protein may be modified (increased) by 
substitution of the T7 promoter lor the T71ac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL21(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (i.e.. less than about 0.5%) of 
soluble pHisBotG protein arc expressed using the above expression systems, the pHisBotG 
construct may be co-expressed with pACYCGro construct as described in Example 32. In 
this case, the recombinant BotG protein may co-purify with the folding chapcrones. The 
contaminating chaperoncs may be removed as described in Example 34. Preparations of 
purified pHisBotG protein are tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotG protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotG protein using Gerbu GMDP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anti-BolG antibodies to neutralize native C. 
bomlinum type Ci toxin is demonstrated using the mouse-C. hatulinum neutralization model 
described in Example 36. 

EXAMPLE 50 

Expression Of Recombinant Bolulinal Toxin Proteins In F.ucaryotic I lost Cells 

Recombinant botulinal C fragment proteins may be expressed in eucaryotic host cells, 
such as yeast and insect cells. 



a) Expression In Yeast 

Botulinal C fragments derived from serotypes A. B. C. D. E. F and G may be 
expressed in yeast cells using a variety of commercially available vectors, for example, the 
PP1C3K and pPIC9K expression vectors (invitrogen) may be employed for expression in the 
methylotrophic yeast. Pichia panaris. When the pPIC3K vector is employed, expression of 
the botulinal C fragment protein will be intracellular. When the pl»IC3K vector is employed, 
the botulinal C fragment protein will be secreted (the alpha factor secretion signal is provided 
on the pPIC9K vector). 

DNA sequences encoding the desired C fragment is inserted into these vectors using 
techniques known to the art. Briefly, the desired botulinal expression cassette (including 
sequences encoding the his-tag: described in the preceding examples) is amplified using the 
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PCR in conjunction with primers that incorporate unique restriction sites at the termini of the 
amplified fragment. Suitable restriction enzyme sites include SnaBl EcoRl. AvrU and Noll 
When the botuiinal C fragment is to be expressed using the pPIC3K vector, the initiator 
methionine (ATG) is provided by the desired Bot gene sequence and a Kozak consensus 
sequence is engineered upstream of the ATG (<?.#., ACCATGG). 

The amplified restriction fragment containing the botuiinal C fragment gene is then 
cloned into the desired expression vector. Recombinant clones are integrated into the Pichia 
pastohs genome and recombinant protein expression is induced using methanol following the 
manufacturer's instructions (Inviirogen Pichia expression kit manual). 

T. hatulinum genes are A/T rich and contain multiple sequences that arc similar to 
yeast transcriptional termination signals (e.g.. TTTTTATA). If premature transcription 
termination is observed when the botuiinal C fragment genes are expressed in yeast, the 
transcription termination signals present in the C fragment genes can be removed by cither 
site directed mutagenesis (utilizing the pALTER system; Promega) or by construction of 
synthetic genes utilizing overlapping synthetic primers. 

The botuiinal C fragment genes may be expressed in other yeast cells using other 
commercially available vectors [e.g.. using the pYES2 vector (Inviirogen) and S. cerevisiae 
cells (Invitrogen)], 

b) Expression In Insect Cells 

Botuiinal C fragments derived from serotypes A. B. C. D. E. F and G may be 
expressed in insect cells using a variety of commercially available vectors. For example, the 
pBlueBac4 transfer vector (Invitrogen) may be employed for expression in Spodoptera 
frugipevtia (Sf)) insect cells (baculovirus expression system) (equivalent baculovirus vectors 
and host cells are avaialble from other vendors, e.g., Pharmingen. San Diego. CA). Botuiinal 
C fragments contained on Ncol/flindlll fragments contained within the pHisBotA-G 
expression constructs (described in the preceding examples) are cloned into the pBlueBac4 
vector (digested with Ncol and HindlU): the Ncol site present on the C fragment constructs 
overlaps with the start codon of the fusion proteins. In the case of botuiinal C fragment 
clones that contain internal HindlU sites (e.g., using the BotG sequences described in Ex. 49). 
the C fragment gene is contained within a Ncol/Xhol fragment on the pI HsBot construct. 
This Ncol/Xhol fragment is excised from pHisBot and inserted into pBlueBac4 digested with 
Ncol and Sail Recombinant baculoviruses are made and the desired recombinant C fragment 
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is expressed in S/9 cells using the protocols provided by the manufacturer (Invitrogen MaxBac 
manual). The resulting constructs will express the pHisBot protein intracellularly (including 
the N-terminal his-tag) under the control of the polyhedrin promoter. For extracellular 
secretion of botulinal C fragment proteins, the C fragment sequences from the pHisBot 
constructs arc cloned into the pMelBacB vector (Invitrogen) as described above for the 
P BlucBac4 vector. When the pMelBacB vector is employed, the his-tagged botulinal C 
fragment proteins are secreted (utilizing a vector-encoded honeybee meli.tin secretion signal) 
and contain a nine amino acid extension at the N-terminus. 

His-tagged botulinal C fragments expressed in yeast or insect cells are purified using 
metal chelation columns as described in the preceding examples. 

From the above it is clear that the present invention provides compositions and 
methods for the preparation of effective multivalent vaccines against C. botulinum neurotoxin. 
It is also contemplated that the recombinant botulinal proteins be used for the production of 
antitoxins. All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Williams, James A. 

Thai ley, Bruce S. 

(iii) NUMBER OF SEQUENCES : 82 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen & Carroll 

(?) Ci?Y ET ^n 2 p Mont 9° mer y Street, Suite 2200 
ic; city: San Francisco 

(D) STATE: California 

If! z?r9lio4 nited Stat6S ° f 

(v) COMPUTER READABLE FORM ■ 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release *i.o. Version #1.30 
(vi) CURRENT APPLICATION DATA- 

(A) APPLICATION NUMBER • US 
<B) FILING DATE: 
(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION * 
(A) NAME: Carroll, Peter G 
<B) REGISTRATION NUMBER: 32 837 
(C) REFERENCE /DOCKET NUMBER • OPHD-02959 

li5£) TELECOMMUNICATION INFORMATION • 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

(2) INFORMATION FOR SEQ ID NO:l: 

<i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GGAAATTTAG CTGCAGCATC TGAC 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH; 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE 'TYPE: DNA {genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
TCTAGCAAAT TCGCTTGTGT TGAA 
(2) INFORMATION FOR SEQ ID NO : 3 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CTCGCATATA GCATTAGACC 
(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CTATCTAGGC CTAAAGTAT 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

I IX) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . . 8130 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATG TCT TTA ATA TCT AAA GAA GAG TTA ATA AAA CTC GCA TAT AGC ATT 
Met Ser Leu lie Ser Lys Glu Glu Leu He Lys Leu Ala Tyr Ser lie 
15 10 15 

AGA CCA AGA GAA AAT GAG TAT AAA ACT ATA CTA ACT AAT TTA GAC GAA 
Arg Pro Arg Glu Asn Glu Tyr Lys Thr He Leu Thr Asn Leu Asp Glu 
20 25 30 

TAT AAT AAG TTA ACT ACA AAC AAT AAT GAA AAT AAA TAT TTG CAA TTA 
Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gin Leu 
35 40 45 

AAA AAA CTA AAT GAA TCA ATT GAT GTT TTT ATG AAT AAA TAT AAA ACT 
Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr Lys Thr 
50 55 60 

TCA AGC AGA AAT AGA GCA CTC TCT AAT CTA AAA AAA GAT ATA TTA AAA 
Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp He Leu Lys 
65 70 75 80 

GAA GTA ATT CTT ATT AAA AAT TCC AAT ACA AGC CCT GTA GAA AAA AAT 
Glu Val lie Leu lie Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 
85 90 95 

TTA CAT TTT GTA TGG ATA GGT GGA GAA GTC AGT GAT ATT GCT CTT GAA 
Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp lie Ala Leu Glu 
100 105 no 
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TAC ATA AAA CAA TGG GCT GAT ATT AAT GCA GAA TAT AAT ATT AAA CTG 
Tyr lie Lys Gin Trp Ala Asp lie Asn Ala Glu Tyr Asn lie Lys Leu 
115 120 125 

TGG TAT GAT AGT GAA GCA TTC TTA GTA AAT ACA CTA AAA AAG GCT ATA 
Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala lie 
130 135 140 

GTT GAA TCT TCT ACC ACT GAA GCA TTA CAG CTA CTA GAG GAA GAG ATT 
Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu lie 
145 150 155 160 

CAA AAT CCT CAA TTT GAT AAT ATG AAA TTT TAC AAA AAA AGG ATG GAA 
Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tyr Lys Lys Arg Met Glu 
165 170 175 

TTT ATA TAT GAT AGA CAA AAA AGG TTT ATA AAT TAT TAT AAA TCT CAA 
Phe lie Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
180 185 190 

ATC AAT AAA CCT ACA GTA CCT ACA ATA GAT GAT ATT ATA AAG TCT CAT 
lie Asn Lys Pro Thr Val Pro Thr lie Asp Asp lie lie Lys Ser His 
195 200 205 

2> CTA GTA TCT GAA TAT AAT AGA GAT GAA ACT GTA TTA GAA TCA TAT AGA 

Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arq 
- 10 215 220 

M) Thr A^n T™ *** A ? A AGT AAT CAT GGG ATA GAT ATC AGG 

Thr Asn Ser Leu Arg Lys lie Asn Ser Asn His Gly lie Asp He Arg 

230 235 240 

GCT AAT AGT TTG TTT ACA GAA CAA GAG TTA TTA AAT ATT TAT AGT CAG 
Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
245 250 255 

GAG TTG TTA AAT CGT GGA AAT TTA GCT GCA GCA TCT GAC ATA GTA AGA 
Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp lie Val Arg 
4() 260 265 270 

TTA TTA GCC CTA AAA AAT TTT GGC GGA GTA TAT TTA GAT GTT GAT ATG 
Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asp Met 
275 280 285 

CTT CCA GGT ATT CAC TCT GAT TTA TTT AAA ACA ATA TCT AGA CCT AGC 
Leu Pro Gly lie His Ser Asp Leu Phe Lys Thr He Ser Arg Pro Ser 

295 - - 



45 



65 



300 



AAG TAT AAA AAA TAT ATA AAT AAT TAT ACA TCA GAA AAC TTT GAT AAA 

Lys Tyr Lys Lys Tyr lie Asn Asn Tyr Thr Ser Glu Asn Phe Asp J£ 

325 330 335 

lIu A^n 7™ GAT *" ?TT *** CT ° ATT ATA GAA AGT AAA 

Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu lie He Glu Ser Lvs 

6() 340 34 5 350 

AGT GAA AAA TCT GAG ATA TTT TCT AAA TTA GAA AAT TTA AAT GTA TCT 

Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 

355 360 365 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



S() I" A 7 S? A CTA GA ° CGT TGG GAA ATG ATA ^AA TTA GAG GCT ATT ATG 960 

Ue G1 >' As P A r3 Trp Glu Met lie Lys Leu Glu Ala He Met 

305 310 315 320 



GAT CTT GAA ATT AAA ATA GCT TTC GCT TTA GGC AGT GTT ATA AAT CAA 
Asp Leu Glu lie Lys lie Ala Phe Ala Leu Gly Ser Val He Asn G^ 
J70 375 380 



1008 



1056 



1104 



1152 
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GCC TTG ATA TCA AAA CAA GGT TCA TAT CTT ACT AAC CTA GTA ATA GAA 

Ala Leu lie Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val lie Glu 

385 390 395 400 

CAA GTA AAA AAT AGA TAT CAA TTT TTA AAC CAA CAC CTT AAC CCA GCC 

Gin Val Lys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 

405 410 415 



CAA GAA ATA AAT AGT CTA TGG AGC TTT GAT CAA GCA AGT GCA AAA TAT 
Gin Glu lie Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lys Tyr 
-° 515 520 525 



1200 



1248 



ATA GAG TCT GAT AAT AAC TTC ACA GAT ACT ACT AAA ATT TTT CAT GAT 1296 

W lie Glu ser Asp Asn Asn Phe Thr Asp Thr Thr Lys lie Phe His Asp 

420 425 430 

TCA TTA TTT AAT TCA GCT ACC GCA GAA AAC TCT ATG TTT TTA ACA AAA 

Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 
13 435 440 445 

ATA GCA CCA TAC TTA CAA GTA GGT TTT ATG CCA GAA GCT CGC TCC ACA 

He Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg Ser Thr 
45 ° 455 460 



1344 



1392 



ATA AGT TTA AGT GGT CCA GGA GCT TAT GCG TCA GCT TAC TAT GAT TTC 14 4 0 

lie Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 
465 470 475 480 

ATA AAT TTA CAA GAA AAT ACT ATA GAA AAA ACT TTA AAA GCA TCA GAT 14 88 

He Asn Leu Gin Glu Asn Thr He Glu Lys Thr Leu Lys Ala Ser Asp 
485 490 495 

TTA ATA GAA TTT AAA TTC CCA GAA AAT AAT CTA TCT CAA TTG ACA GAA 15 36 

Leu He Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 
500 505 510 



1584 



CAA TTT GAG AAA TAT GTA AGA GAT TAT ACT GGT GGA TCT CTT TCT GAA 16 3 2 

Gin Phe Glu Lys Tyr Val Arg Asp Tyr Thr Glv Gly Ser Leu Ser Glu 
530 535 ' 540 



a«« ^ T ?™ " T AAT ^ ^ CT GCC CTC GAC AAA AAC TAT 16 8 0 

^i.. * - ' * Ala Leu Asp Lys Asn Tyr 

555 560 



172B 



™* oinw in ami aai alt GCC CTC GAC AAA AAC TAT 

Asp Asn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 
545 550 «~ 

45 TTA TTA AAT AAT AAA ATT CCA TCA AAC AAT GTA GAA GAA GCT GGA AGT 

Leu Leu Asn Asn Lys lie Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 

565 570 575 

AAA AAT TAT GTT CAT TAT ATC ATA CAG TTA CAA GGA GAT GAT ATA AGT 17 76 

?U Lys Asn Tyr Val His Tyr He He Gin Leu Gin Gly Asp Asp He Ser 
580 585 590 

TAT GAA GCA ACA TGC AAT TTA TTT TCT AAA AAT CCT AAA AAT AGT ATT 1824 

Tyr Glu Ala Thr Cys Asn Leu Phe Ser Lys Asn Pro Lys Asn Ser He 

^ 595 600 605 

ATT ATA CAA CGA AAT ATG AAT GAA AGT GCA AAA AGC TAC TTT TTA AGT 18 72 

He lie Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tvr Phe Leu Ser 

61 ° 615 620 * 

GAT GAT GGA GAA TCT ATT TTA GAA TTA AAT AAA TAT AGG ATA CCT GAA 192 0 

Asp Asp Gly Glu Ser lie Leu Glu Leu Asn Lys Tyr Arg He Pro Glu 

625 630 635 640 

AGA TTA AAA AAT AAG GAA AAA GTA AAA GTA ACC TTT ATT GGA CAT GGT 196 8 

Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe He Gly His Gly 

645 650 655 
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T^t ^ T ^ CTA *** GCA TCC AAT ATT CCA GGA TTA GCA 

lie Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly ™ Ala 

790 7 « aoo 

Ser lie Ser G^ Asn n * ?** ^ TTA ™ CTT GAT GCA AGT G " ACT 
6 Ser Glu As ? Ile L V S Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 810 815 

Pro SJ 5£ IE IS A TI ™ f T CTT CTT *AT ATT GAA TCT 

Asp Thr Lys Phe Ile Leu Asn Asn Leu Lys Leu Asn lie Glu Ser 

8i0 825 830 

III A TI r? G a" I AC ATT TAT TAT GAA AAA TTA GAG CCT GTT AAA AAT 
Sei Ile Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro Val Lys fttn 

s is sj s is s: 2! 15? 2! S £ s; ™ £ K 21 

S! £ S JS E S SJ ^ ™ S S i£ IS S 

885 890 895 

Thr Ivr sfl vI A AGA IP ATT AAC AAA AGT AAT GGT GAG TCA GTT TAT 
Thr Tyr Ser Val Arg Phe lie Asn Lys Ser Asn Gly Glu Ser Val Tyr 

905 910 

s sj s sk ^ s; ?n si s $ ™ ~ sk se s s 



2016 



2064 



AAA GAT GAA TTC AAC ACA AGC GAA TTT GCT AGA TTA AGT GTA GAT TCA 
Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg EI Ser vll AsJ sSr 
660 665 6 7 0 

CTT TCC AAT GAG ATA AGT TCA TTT TTA GAT ACC ATA AAA TTA GAT ATA 
Leu ser Asn Glu lie Ser Ser Phe Leu Asp Thr lie i£ ™ Asp lie 

TCA CCT AAA AAT GTA GAA GTA AAC TTA CTT GGA TGT AAT a TP ttt B1 ~r 
Ser Pro Lys Asn Val Glu Val Asn Leu Uel lyl t£ Met III J£ 

695 700 

ICr A A n pP AAT ?71 GAA GAA ACT TAT CCT GGG AAG TTG CTA TTA AGT 
Tyr Asp Phe Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Sei 

715 720 

ill f C f** A T T ACT TCC ACT TTA CCT G AT GTA AAT AAA AAT TCT 

He Met Asp Lys lie Thr Ser Thr Leu Pro Asp Val Asn 2JJ £1 III 

725 7 30 735 

ill ?£ Ue Ala f T °^ ° TA AGA ATT AAT AGT GAG GGA 

lie Thr Ile Gly Ala Asn Gin Tyr Glu Val Arg lie Asn Ser Glu Gly 

745 750 

AGA AAA GAA CTT CTG GCT CAC TCA GGT AAA TGG ATA AAT AAA GAA GAA 
Aig Lys Glu Leu Leu Ala His Ser Gly Lys Trp He A^n Lys G^ G^ 

760 765 

Til ill mI? c GC n AT TTA TCT AGT GAA TAC TTT TTT GAT TCT 

Ma lie Met Ser Asp Leu Ser Ser Lys Glu Tyr lie Phe Phe Asp Ser 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2686 



2736 
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TCA TTA TCA ATT TAT TCT GCA ATA GGT ATA GAA ACA GAA AAT CTA GAT 
Ser Leu Ser lie Tyr Ser Ala lie Gly lie Glu Thr Glu Asn Leu Asp 
1205 1210 12 15 

TTT TCA AAA AAA ATA ATG ATG TTA CCT AAT GCT CCT TCA AGA GTG TTT 
Phe Ser Lys Lys He Mec Met Leu Pro Asn Ala Pro Ser Arg val Phe 
1220 1225 12I0 

TGG TGG GAA ACT GGA GCA GTT CCA GGT TTA AGA TCA TTG GAA AAT GAC 
Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asp 
12 35 1240 !245 * 

r?v Th T 7™ CTT GAT TCA ATA AGA GAT TTA TAC C CA GGT AAA TTT 

Gly Thr Arg Leu Leu Asp Ser He Arg Asp Leu Tyr Pro Gly Lys Phe 
1 su 1255 1260 

TAC TGG AGA TTC TAT GCT TTT TTC GAT TAT GCA ATA ACT ACA TTA AAA 
Tyr Trp Arg Phe Tyr Ala Phe Phe Asp Tyr Ala lie Thr Thr leu L^ 

1270 1275 1280 

CCA GTT TAT GAA GAC ACT AAT ATT AAA ATT AAA CTA GAT AAA GAT APT 
Pro val Tyr Glu Asp Thr Asn lie Lys He Lys Leu Asp L^ ts P Thr 
1285 1290 12 g 5 

AGA AAC TTC ATA ATG CCA ACT ATA ACT ACT AAC GAA ATT AGA AAC AAA 
Arg Asn Phe lie Mec Pro Thr He Thr Thr Asn Glu ill Arg j£S Jtt 
1300 1305 13 io 

TTA TCT TAT TCA TTT GAT GGA GCA GGA GGA ACT TAC TCT TTA TTA TTA 

Leu Ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu llu 
13 1 5 1320 !325 

Ipr w I* T o CA rT A TCA ACG AAT ATA TTA TCT AAA GAT GAT TTA 

Ser Se^lvr Pro He Ser Th^Asn He Asn Leu Ser Lys Asp Asp Leu 

r'rn n A ll T AAT A T T GAT AAT GAA GTA AGA GAA ATA TCT ATA GAA AAT 

1345 6 ASn 116 A ^ P ASn Glu Val Ar ^ G1 " He Ser He Glu Asn 

1350 1355 136Q 

GGT ACT ATT AAA AAA GGA AAG TTA ATA AAA GAT GTT TTA ACT AAA ATT 
Gly Thr He Lys Lys Gly Lys Leu He Lys Asp Val Leu Ser Lys lie 
1365 1370 1375 

A A D t! A ^1 f T ^ CTT ATT ATA GGC AAT CAA ACA ATA GAT TTT 

A.p lie Asn Lys Asn Lys Leu lie He Gly Asn Gin Thr He Asp Phe 
1380 1385 1390 

Ser Glv A A n tT? ^ ^ GAT AGA TAT ATA TTC TTG ACT TGT GAG 

Y A fL Ile ASP Asn Lys As ? Ar 9 T V r He Phe Leu Thr Cys Glu 
XJab 1400 1405 

TTA GAT GAT AAA ATT AGT TTA ATA ATA GAA ATA AAT CTT GTT GCA AAA 
Leu Asp Asp Lys He Ser Leu lie He Glu He Asn Leu Val Ala 

1HiU 1415 I4 20 

I" Tvl S« Jl? I™ I™ TCT GGG GAT *** TAT TTG ATA TCC AAT 

Ser Tyi Se, Leu Leu Leu Ser Gly Asp Lys Asn Tyr Leu lie Ser Asn 

1430 1435 1440 

TTA TCT AAT ACT ATT GAG AAA ATC AAT ACT TTA GGC CTA GAT AGT AAA 
Leu ser Asn Thr lie Glu Lys He Asn Thr Leu Gly Leu Asp Ser J£J 
1445 1450 1455 i 
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AAT ATA GCG TAC AAT TAC ACT GAT GAA TCT AAT AAT AAA TAT TTT GGA 
Asn He Ala Tyr Asn Tyr Thr Asp Glu Ser Asn Asn Lys Tyr Phe Gly 
1460 1465 1470 

GCT ATA TCT AAA ACA AGT CAA AAA AGC ATA ATA CAT TAT AAA AAA GAC 
Ala He Ser Lys Thr Ser Gin Lys Ser lie He His Tyr Lys Lys Asp 
1475 1480 1485 



4416 



4464 



4560 



4608 



4656 



4704 



AGT AAA AAT ATA TTA GAA TTT TAT AAT GAC AGT ACA TTA GAA TTT AAC 4 512 

M> Ser Lys Asn He Leu Glu Phe Tyr Asn Asp Ser Thr Leu Glu Phe Asn 
1490 1495 1500 

AGT AAA GAT TTT ATT GCT GAA GAT ATA AAT GTA TTT ATG AAA GAT GAT 
Ser Lys Asp Phe He Ala Glu Asp He Asn Val Phe Met Lys Asp Asn 
^ 1S05 1510 1515 1520 

ATT AAT ACT ATA ACA GGA AAA TAC TAT GTT GAT AAT AAT ACT GAT AAA 
He Asn Thr lie Thr Gly Lys Tyr Tyr Val Asp Asn Asn Thr Asp Lys 
2() 1525 "30 1535 

AGT ATA GAT TTC TCT ATT TCT TTA GTT AGT AAA AAT CAA GTA AAA GTA 
Ser He Asp Phe Ser lie Ser Leu Val Ser Lys Asn Gin Val Lys Val 
1540 1545 1550 

25 AAT GGA TTA TAT TTA AAT GAA TCC GTA TAC TCA TCT TAC CTT GAT TTT 

Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1560 1565 

GTG AAA AAT TCA GAT GGA CAC CAT AAT ACT TCT AAT TTT ATG AAT TTA 47 52 

Mi Jal Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 

1570 1575 1580 

TTT TTG GAC AAT ATA AGT TTC TGG AAA TTG TTT GGG TTT GAA AAT ATA 4 8 00 

Phe Leu Asp Asn He Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn He 
•° 1585 1590 1595 1600 

AAT TTT GTA ATC GAT AAA TAC TTT ACC CTT GTT GGT AAA ACT AAT CTT 4 84 8 

Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Gly Lys Thr Asn Leu 
1605 1610 1615 

GGA TAT GTA GAA TTT ATT TGT GAC AAT AAT AAA AAT ATA GAT ATA TAT 48 96 

Gxy Tyr Val Glu Phe He Cys Asp Asn Asn Lys Asn He Asp lie Tyr 
1^20 1625 1630 

45 TTT GGT GAA TGG AAA ACA TCG TCA TCT AAA AGC ACT ATA TTT AGC GGA 4 944 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Gly 
1635 1640 1645 

AAT GGT AG A AAT GTT GTA GTA GAG CCT ATA TAT AAT CCT GAT ACG GGT 4 9 92 

Asn Gly Arg Asn Val Val Val Glu Pro He Tyr Asn Pro Asp Thr Gly 
1650 1655 1660 

GAA GAT ATA TCT ACT TCA CTA GAT TTT TCC TAT GAA CCT CTC TAT GGA 504 0 

Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tyr Gly 
" 1665 1670 1675 1680 



ATA GAT AGA TAT ATA AAT AAA GTA TTG ATA GCA CCT GAT TTA TAT ACA 5 08B 

lie Asp Arg Tyr lie Asn Lys Val Leu lie Ala Pro Asp Leu Tyr Thr 
1685 1690 1695 

AGT TTA ATA AAT ATT AAT ACC AAT TAT TAT TCA AAT GAG TAC TAC CCT 5136 

Ser Leu He Asn lie Asn Thr Asn Tyr Tyr Ser Asn Glu Tyr Tyr Pro 
1700 1705 1710 

GAG ATT ATA GTT CTT AAC CCA AAT ACA TTC CAC AAA AAA GTA AAT ATA 5184 

Glu He lie Val Leu Asn Pro Asn Thr Phe His Lys Lys Val Asn He 
1715 1720 1725 
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1885 

Gil SI? m T ? ^ G TTG GGA GTA TTT GGA CCT GAT GGA TTT GAA TAT 

iHSb 1900 

TTT GCA CCT GCC AAT ACT CAA AAT AAT AAC ATA GAA GGT rrc lta 
Phe AL Pro Ala Asn Thr Gin Asn A S n Asn He G^ SJ All He 

1910 1915 1920 

GTT TAT CAA AGT AAA TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT 
val Tyr Gln S er Lys Phe Leu Thr Leu Asn Gly L^ Lys Tyr ?£ ^ 
1925 1"0 1935 

ttl ^ T ^ C TCA *** GCA GTC ACT GQ A TGG AGA ATT ATT AAC AAT GAG 
Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He 111 ™ Glu 
1940 1945 1950 

AAA TAT TAC TTT AAT CCT AAT AAT GCT ATT GCT GCA GTC CCA ttp ra ? 
Lys Tyr Tyr^Phe Asn Pro Asn AanAl. lie All Ala Vainly lIu g^ 

GTA ATT GAC AAT AAT AAG TAT TAT TTC AAT CCT GAC ACT GCT ATC ATC 

1970 P ^ ^ T^/^ PhS ASn Pr ° As ? Thr Ala Ilo lie 

1975 1980 

TCA AAA GGT TGG CAG ACT GTT AAT GGT AGT AGA TAC TAC TTT GAT apt 
Ser s Lys Gly Trp Gln Thr Val Asn Gly Ser Arg Tyr lyr III 2J Thr 

!995 2000 



5232 



5280 



ten lit Itl c GT T CT TCT TTT GAG TAT AAA TGG TCT ACA GAA GGA AGT 
Asn Leu Asp Ser Ser Ser Phe Glu Tyr Lys Trp Ser Thr Glu Gly Ser 

1735 1740 

A«n 17 tT T T A GTT AGA TAC TTA GAA GAA AGT AAT AAA AAA ATA TTA 
Asp Phe lie Leu Val Arg Tyr Leu Glu Glu Ser Asn Lys Lys lie ™ 

1750 1755 1760 

g£ tT* AGA A T C *** GGT ATC TTA TCT AAT ACT CAA TCA TTT AAT 533 « 

Gln Lys He Arg lie Lys Gly lie Leu Ser Asn Thr Gln Ser Phe Asn 
1765 1770 1775 

AAA ATG AGT ATA GAT TTT AAA GAT ATT AAA AAA CTA TCA TTA GGA TAT 
Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser leu GlJ lyl 
1780 1785 1790 y y 

ATA ATG AGT AAT TTT AAA TCA TTT AAT TCT GAA AAT rjs tt& p»t *™ 
Ue Met Ser Asn Phe Lys Ser Phe Asn S SI H» SI Zel £J Arg 
1,95 1800 1805 ^ s 

GAT CAT TTA GGA TTT AAA ATA ATA GAT AAT AAA ACT TAT TAC TAT CAT 
Asp H,s Leu Gly Phe Lys lie He Asp Asn Lys Thr ?£ Tyr £J 

J- "is 1B20 

GAA GAT AGT AAA TTA GTT AAA GGA TTA ATC AAT ATA AAT AAT TCA TTA «,„ 
GluA.p ser Lys Leu Val^ys Gly Leu He As^Ile JS £n ler ™ 

Phf t AT IP GAT ° CT ATA GAA TTT AAC TTA GTA ACT GGA TGG CAA ACT 
Phe Tyr Phe Asp Pro He Glu Phe Asn Leu Val Thr Gly Trp Gln tS 
1845 1850 1855 

lie El *** *** TAT TAT TTT GAT ATA AAT ACT GGA GCA GCT TTA 

He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
I860 1865 187Q 

ACT AGT TAT AAA ATT ATT AAT GGT AAA CAC TTT TAT TTT AAT AAT GAT 
Thr Ser Tyr Lys He He Asn Gly Lys His Phe Tyr pK HI AsJ 

ltWb 1880 lope K 



5376 



5424 



5472 



5568 



5616 



5664 



571: 



5760 



5808 



5856 



5904 
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GAT ACC GCT ATT GCC TTT AAT GGT TAT AAA ACT ATT GAT GGT AAA CAC 6 04 8 

Asp Thr Ala lie Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His 
2005 2010 2015 

TTT TAT TTT GAT AGT GAT TGT GTA GTG AAA ATA GGT GTG TTT AGT ACC 6 0 96 

Phe Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr 
2020 2025 2030 

TCT AAT GGA TTT GAA TAT TTT GCA CCT GCT AAT ACT TAT AAT AAT AAC 6144 
1U Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 
2035 2040 2045 

ATA GAA GGT CAG GCT ATA GTT TAT CAA AGT AAA TTC TTA ACT TTG AA' r 6192 
lie Glu Gly Gin Ala lie Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn 
i:> 2050 2055 2060 

GGT AAA AAA TAT TAC TTT GAT AAT AAC TCA AAA GCA GTT ACC GGA TTG 624 0 

Gly Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 
1Q 2065 2070 2075 2080 

CAA ACT ATT GAT AGT AAA AAA TAT TAC TTT AAT ACT AAC ACT GCT GAA 62 8 8 

Gin Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 
2085 2090 2095 

2> GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT 6 3 36 

Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn 
2100 2105 ' 2110 

ACT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA 6 3 84 

M) rhr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys 

2115 2120 2125 

AAA TAT TAC TTT AAT ACT AAC ACT GCT ATA GCT TCA ACT GGT TAT ACA 64 32 

Lys Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr 
-° -130 2135 2140 



40 



ATT ATT AAT GGT AAA CAT TTT TAT TTT AAT ACT GAT GGT ATT ATG CAG 64 8 0 

He lie Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly lie Met Gin 

2145 2150 2155 2160 

ATA GGA GTG TTT AAA GGA CCT AAT GGA TTT GAA TAT TTT GCA CCT GCT 6 52 8 

lie Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala 
2165 2170 2175 

4? AAT ACG GAT GCT AAC AAC ATA GAA GGT CAA GCT ATA CTT TAC CAA AAT 6 5 76 

Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala lie Leu Tyr Gin Asn 
2180 2185 2190 



50 



60 



h5 



GAA TTC TTA ACT TTG AAT GGT AAA AAA TAT TAC TTT GGT AGT GAC TCA 66 24 

Glu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tvr Phe Gly Ser Asp Ser 
2195 2200 2205 

AAA GCA GTT ACT GGA TGG AG A ATT ATT AAC AAT AAG AAA TAT TAC TTT 66 72 

Lys Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tyr Tyr Phe 
2210 2215 2220 

AAT CCT AAT AAT GCT ATT GCT GCA ATT CAT CTA TGC ACT ATA AAT AAT 6 720 

Asn Pro Asn Asn Ala He Ala Ala He His Leu Cys Thr lie Asn Asn 
2225 2230 2235 2240 

GAC AAG TAT TAC TTT AGT TAT GAT GGA ATT CTT CAA AAT GGA TAT ATT 6 76 8 

Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He 
2245 2250 2255 

ACT ATT GAA AGA AAT AAT TTC TAT TTT GAT GCT AAT AAT GAA TCT AAA 6 816 

Thr lie Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys 
2260 2265 2270 



- 228 - 



WO 98/08540 PCT/US97/15394 



15 



40 



45 



f>0 



65 



GTA ACA GGA GTA TTT AAA GGA CCT AAT GGA TTT GAG TAT TTT GCA 
Met Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala 
2275 2280 2285 

CCT GCT AAT ACT CAC AAT AAT AAC ATA GAA GGT CAG GCT ATA GTT TAC 
Pro Ala Asn Thr His Asn Asn Asn lie Glu Gly Gin Ala lie Val Tyr 
2290 2295 2300 

CAG AAC AAA TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT GAT AAT 
Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asp Asn 
■* 2310 2315 2320 

GAC TCA AAA GCA GTT ACT GGA TGG CAA ACC ATT GAT GGT AAA AAA TAT 
Asp Ser Lys Ala Val Thr Gly Trp Gin Thr lie Asp Glv Lys Lys Tyr 
2325 2330 ' 2 335 

TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT 
*yr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr lie 
2340 2345 2350 

GAT GGT AAA AAA TAT TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT 
Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
-355 2360 2365 

GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT 
oly Trp Gin Thr lie Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
-370 2375 2380 



GGA TTT GAA TAC TTT GCA CCT GCT AAT ACG GAT GCT AAC AAC ATA GAA 
Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn lie Glu 
2420 2425 2430 

GGT CAA GCT ATA CTT TAC CAA AAT AAA TTC TTA ACT TTG AAT GGT AAA 
Gly Gin Ala lie Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Glv Lvs 
243 5 2440 2445 

AAA TAT TAC TTT GGT AGT GAC TCA AAA GCA GTT ACC GGA CTG CCA ACT 
,//s Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arq Thr 
- 450 2455 2460 



ACT GGA TGG CAA ACT ATT AAT GGT AAA AAA TAC TAC TTT AAT ACT AAC 
Thr Gly Trp Gin Thr lie Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
24 85 2490 2 495 

ACT TCT ATA GCT TCA ACT GGT TAT ACA ATT ATT AGT GGT AAA CAT TTT 
Thr Ser He Ala Ser Thr Gly Tyr Thr lie lie Ser Gly Lys His Phi 
2500 2505 2510 

TAT TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT 
Tyr Phe Asn Thr Asp Gly lie MeC Gin lie Gly Val Phe Lys Gly Pro 
*515 2520 2525 

GAT GGA TTT GAA TAC TTT GCA CCT GCT AAT ACA GAT GCT AAC AAT ATA 
Asp G y Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn UJ lie 
^3->u 2535 2540 



6864 



6911 



6960 



7008 



7056 



7104 



7152 



M) aZ tT ^ I CA A 5 T GGT TAT ACA AGT ATT AAT GGT ^ CA T TTT TAT 7200 

Phe He Ala Ser Thr Gly Tyr Thr Ser lie Asn Gly Lys His Phe Tyr 

* 2QS 2390 2395 2400 

TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT AAT 
Phe Asn Thr Asp Gly lie Met Gin lie Gly Val Phe Lvs Gly Pro Asn 
2405 2410 * 2415 



7248 



7296 



7344 



7392 



SO iTI £1 ^ TAT TAC TTT AAT ACT AAC ACT GCT GTT GCA GTT 7440 

lie Asp Glv Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 
2470 2475 2 480 



7488 



7536 



7584 
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- 229 - 



WO 98/08540 PCT/US97/ 15394 



20 



40 



50 



60 



65 



70 



GAA GGT CAA GCT ATA CGT TAT CAA AAT AGA TTC CTA TAT TTA CAT GAC 
Glu Gly Gin Ala lie Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
~ bqb 25 50 2555 2560 

AAT ATA TAT TAT TTT GGT AAT AAT TCA AAA GCG GCT ACT GGT TGG GTA 
Asn lie Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val 
2565 2570 2575 



GCG AAT GGT TAT AAA ACT ATT GAT AAT AAA AAT TTT TAC TTT AGA AAT 
Ala Asn Gly Tyr Lys Thr lie Asp Asn Lys Asn Phe Tyr Phe Arg Asn 
2595 2600 2605 

GGT TTA CCT CAG ATA GGA GTG TTT AAA GGG TCT AAT GGA TTT GAA TAC 
Gly Leu Pro Gin lie Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tyr 
2610 2615 2620 

TTT GCA CCT GCT AAT ACG GAT GCT AAC AAT ATA GAA GGT CAA GCT ATA 
Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn lie Glu Gly Gin Ala lie 
•^ 25 2630 2635 2 640 

CGT TAT CAA AAT AGA TTC CTA CAT TTA CTT GGA AAA ATA TAT TAC TTT 
Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Glv Lys He Tyr Tvr Phe 
2645 2650 * 2655 



01 A TmT TAC TTT ATG CCT GAT ACT GCT ATG GCT GCA GCT GGT GGA CTT 
Val Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala Ala Gly Gly Leu 
2675 2680 2685 

TTC GAG ATT GAT GGT GTT ATA TAT TTC TTT GGT GTT GAT GGA GTA AAA 
Phe Glu lie Asp Gly Val lie Tyr Phe Phe Gly Val Asp Gly Val Lys 
2690 2695 2700 

GCC CCT GGG ATA TAT GGC TAA 
Ala Pro Gly He Tyr Gly 
2705 2710 



4:> (2) INFORMATION FOR SEQ ID NO : 6 : 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2710 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Leu lie Ser Lys Glu Glu Leu He Lys Leu Ala Tyr Ser lie 
1 5 io 15 

Arg Pro Arg Glu Asn Glu Tyr Lys Thr lie Leu Thr Asn Leu Asp Glu 
20 25 3 o 

Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gin Leu 
35 40 45 

Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr Lys Thr 
50 55 60 

Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp lie Leu Lys 
65 7 ° 75 80 



7680 



7728 



10 ThI tT a T S? T AGA TAT TAC TTC GAG CCT ACA GCT ATG GGT 7776 

I" Thr lie Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly 

258 ° 2585 2590 



7824 



787: 



7920 



7968 



M) * ^ ° A *** GCA GTT ACT GGA TGG CAA ACT ATT ^T GGT AAA 8016 

M) ojy Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr lie Asn Gly Lvs 

2660 2665 2670 



8064 



8112 



8133 
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Glu Val He Leu He Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 
85 90 95 

Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp He Ala Leu Glu 
100 105 no 

Tyr He Lys Gin Trp Ala Asp He Asn Ala Glu Tyr Asn He Lys Leu 
115 120 125 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 140 

Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu lie 

15 150 155 160 

Gin Asn Pro Gin Phe Asp Asn Met: Lys Phe Tvr Lvs Lys Arg M*t Glu 
165 170 175 

Phe He Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
160 185 190 

lie Asn Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser His 
195 200 205 

Leu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tvr Ara 
210 215 220 " 

Thr Asn Ser Leu Arg Lys He Asn Ser Asn His Gly lie Asn lie Arq 

Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
245 250 255 

Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Arg 
260 265 2 70 

Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asd Met 
275 280 285 

Leu Pro Gly He His Ser Asp Leu Phe Lys Thr He Ser Arq Pro Ser 
290 295 300 

Ser He Gly Leu Asp Arg Trp Glu Met He Lvs Leu Glu Ala lie Met 
305 310 315 320 

Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr Ser Glu Asn Phe Asp Lvs 
32 5 330 335 * 

Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu lie He Glu Scr Lvs 
340 350 

Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 
355 360 3 6 5 

Asp Leu Glu He Lys He Ala Phe Ala Leu Gly Ser Val He Asn Gin 
370 375 380 

Ala Leu He Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val He Glu 
385 390 395 ,00 

Gin Val Lys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 
4 °5 410 415 

lie Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys He Phe His Asd 
420 425 430 

Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lv- 
435 440 445 
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lie Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg ser Thr 
450 455 460 

lie Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 
465 470 475 480 

lie Asn Leu Gin Glu Asn Thr lie Glu Lys Thr Leu Lvs Ala Ser Asp 
485 490 ' 495 

Leu lie Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 
50° 505 510 

Gin Glu He Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lvs Tvr 
515 520 525 

Gin Phe Glu Lys Tyr Val Arg Asp Tyr Thr Gly Gly Ser Leu Ser Glu 
530 535 540 

Asp Asn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 
545 550 555 560 

Leu Leu Asn Asn Lys He Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 
565 570 575 

Lys Asn Tyr Val His Tyr lie He Gin Leu Gin Glv Asp Asp H» Ser 
SB0 585 * 590 

Tyr GJ.u Ala Thr Cys Asn Leu Phe . Ser Lys Asn Pro Lys Asn Ser He 

600 6 o5 

He He Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe Leu Ser 
Gi0 615 6 20 

Asp Asp Gly Glu Ser He Leu Glu Leu Asn Lys Tyr Arg He Pro Glu 
625 6 ^0 635 64 o 

Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe lie Gly His Gly 
645 650 655 

Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 

665 6 70 

Leu Ser Asn Glu He Ser Ser Phe Leu Asp Thr He Lvs Leu Asd He 
67 $ 680 685 

S«r Pro Lys Asn Val Glu Val Asn Leu Leu Glv Cys Asn M-r Ph* s^r 
690 695 * 700 

Tyr Asp Phe Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Ser 
7 °5 710 71S 



72 0 



He Met Asp Lys He Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Ser 
725 730 735 

He Thr lie Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly 
74 0 745 750 

Arg Lys Glu Leu Leu Ala His Ser Gly Lys Trp He Asn Lvs Glu Glu 
755 760 765 ' 

Ala He Met Ser Asp Leu Ser Ser Lys Glu Tyr He Phe Phe Asd Ser 
770 775 780 

M lit ASP Acn LyS LeU LyS Aia Lys Ser L V S Asn 11* Pro Gly Leu Ala 

() " 785 7 ^0 795 8 oo 

Ser He Ser Glu Asp He Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 810 815 
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Pro Asp Thr Lys Phe lie Leu Asn Asn Leu Lys Leu Asn He Glu Ser 
820 825 830 

Ser lie Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro Val Lys Asn 
835 840 845 

He lie His Asn Ser He Asp Asp Leu He Asp Glu Phe Asn Leu Leu 
850 855 860 

Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 
865 870 875 880 

Asp Glu Lys Tyr Leu He Ser Phe Glu Asp He Ser Lys Asn Asn Ser 
885 890 895 

Thr Tyr Ser Val Arg Phe He Asn Lys Ser Asn Gly Glu Ser Val Tyr 
900 905 93.0 

Val Glu Thr Glu Lys Glu lie Phe Ser Lys Tyr Ser Glu His lie Thr 
915 920 925 

Lys Glu He Ser Thr lie Lys Asn Ser lie lie Thr Asp Val Asn Gly 
930 935 940 

Asn Leu Leu Asp Asn Lie Gin Leu Asp His Thr Ser Gin Val Asn Thr 
945 950 955 960 

Leu Asn Ala Ala Phe Phe lie Gin Ser Leu He Asp Tyr Ser Ser Asn 
965 970 975 

Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gin Leu Tyr 
980 985 990 

Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tvr Asp Ser He Gin 
•° 995 1000 * 10 05 

Leu val Asn Leu lie Ser Asn Ala Val Asn Asp Thr He Asn Val Leu 
1010 1015 1020 

40 Pro Thr lie Thr Glu Gly He Pro He Val Ser Thr He Leu Asp Glv 

102 * 1030 1035 xoio 

lie Asn Leu Gly Ala Ala He Lys Glu Leu Leu Asp Glu His Asp Pro 
10^5 1050 1055 

Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala He Asn 
1060 1065 1070 

Met Ser Leu Ser lie Ala Ala Thr Val Ala Ser He Val Gly He Glv 
- U 1075 1080 1085 

Ala Glu Val Thr lie Phe Leu Leu Pro lie Ala Gly He Ser Ala Glv 
1°90 1095 iioo 

He Pro Ser Leu Val Asn Asn Glu Leu lie Leu His Asp Lys Ala Thr 
1105 HIO 1115 U20 

Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lvs Lys Tyr Gly 
H25 H30 * 1135 

Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro lie Asp Asp Leu 
H40 H45 1150 

Val lie Ser Glu lie Asp Phe Asn Asn Asn Ser He Lvs Leu Glv Thr 
0? 1155 H60 ii 6 5 

Cys Asn lie Leu Ala Met Glu Gly Gly Ser Gly His Thr Val Thr Glv 
H70 H75 H80 
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Asn lie Asp His Phe Phe Ser Ser Pro Ser lie Ser Ser His lie Pro 

1190 1195 1200 

Ser Leu Ser U e Tyr Ser Ala lie Gly lie Glu Thr Glu Asn Leu Asp 
1205 1210 1215 

Phe Ser Lys Lys lie Met Met Leu Pro Asn Ala Pro Ser Arg Val Phe 
1220 1225 1230 

Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asp 
iiJ = 1240 12<1S 

aiy Thr Arg Leu Leu Asp Ser lie Arg Asp Leu Tyr Pro Civ Lys Phe 

15 1250 1255 1250 ' 7 

Tyr Trp Arg Phe Tyr Ala phe phe ftsp Tyj . ^ ^ ^ ^ ^ 

1270 1275 ^ 8() 

Pro Val Tyr Glu Asp Thr Asn He Lys lie Lys Leu Asp Lys Asp Thr 
1285 12 90 12 g 5 

Arg Asn Phe lie Met Pro Thr lie Thr Thr Asn Glu He Arq Ann Lys 
1300 1305 - -" - r 



25 



35 



40 



50 



55 



1310 



Leu ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu Leu 
1315 1320 1325 

Ser Ser T yr Pro He Ser Thr Asn lie Asn Leu Ser Lys Asp Asu Leu 
;,() 1335 1340 

T^c 11 "-' Phe Asn Iie A fP Asn Glu Val Arg Glu lie Ser He Glu Asn 

1350 1355 136Q 

Gly Thr lie Lys Lys Gly Lys Leu lie Lys Asp Val Leu Ser Lys He 
1365 i37 0 13?5 

Asp He Asn Lys Asn Lys Leu He He Gly Asn Gin Thr He Asp Phe 
1380 i3 8 5 13go 

Ser CJy Asp lie Asp Asn Lys Asp Arg Tyr lie Phe Leu Thr Cys Glu 
139= i4 00 1<J05 

Leu Asp Asp Lys lie Ser Leu He He Glu He Asn Leu Val Ala Lys 
45 1410 HIS 1420 

Ser Tyr Ser Leu Leu Leu Ser Gly Asp Lys Asn Tyr Leu lie Scr Asn 
5 1430 ^35 1440 



Leu Ser Asn Thr He Glu Lys He Asn Thr Leu Gly Leu Asp Ser Lys 
1445 1450 1455 

Asn He Ala Tyr Asn Tyr Thr Asp Glu Ser Asn Asn Lys Tvr Phe Glv 
1460 i 46 5 147Q 

Ala He Ser Lys Thr Ser Gin Lys Ser lie lie His Tyr Lys Lys Asp 
1475 1480 i 48 5 

Hon* 5 " 116 LeU Glu Phe Tyr Asn As P Ser Thr L eu Glu Phe Asn 
C() 1490 1495 1500 

lfos LyS A$iP ^ 116 G1U ASP IU ASn Val Phe Met L V S Asp Asp 

15 . 10 1515 1520 

He Asr. Thr He Thr Gly Lys Tyr Tyr Val Asp Asn Asn Thr Asp Lys 
1525 1530 i 5 5 5 

ser lie Asp Phe Ser He Ser Leu Val Ser Lys Asn Gin Val Lvs Val 
1540 1545 is5o " 
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Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1S60 !5 65 ^ 

Val Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 
1570 .1575 158 o 

Phe Leu Asp Asn lie Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn He 
1585 1590 1595 16 oo 

Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Gly Lys Thr Asn Leu 
1 6 °5 1610 1615 

Gly Tyr Val Glu Phe lie Cys Asp Asn Asn Lys Asn lie Asp lie Tvr 
|S 1620 1625 1630 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Gly 
1635 1640 X645 

^0 ?^n Ar9 Val Val G1U Pr ° Ue Tyr Asn Pr ° Asp Thr Gly 

ibi0 1655 1.66O 

?665 ASP fl7 0 LeU ^ G1U Pr ° LSU Tyi ' Gly 

2> lie Asp Arg Tyr He Asn Lys Val Leu He Ala Pro Asp Leu Tvr Thr 

1685 1690 1695 

Ser Leu lie Asn He Asn Thr Asn Tyr Tyr Ser Asn Glu Tvr Tvr P-o 
3() 1700 1 7 °5 1710 ' 

Glu lie He Val Leu Asn Pro Asn Thr Phe His Lys Lys Val Asn He 
1715 1720 i 725 

IS AS " V^ n ASP SCr Ser Phe G1U Tyr Lys Tr P Ser Thr Glu Gly Ser 

JU 1735 J74Q 

Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lvs Lys lie Leu 
45 1750 1755 ' 1760 

40 Gin Lys lie Arg He Lys Gly He Leu Ser Asn Thr Gin Ser Phe Asn 

I 765 1770 1775 

Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser Leu Gly Tvr 
45 1780 1 7 85 1790 

lie Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp Arc 
1795 isoo i 805 

50 ASP ?«?o LeU Gly Phe LyS 116 Ue Asp Asn Lys Thl ' T >'r Tyr Tyr Asp 
1810 ibis 1820 ' V 

Glu Asp ser Lys Leu Val Lys Gly Leu He Asn He Asn Asn Ser Leu 
3 1830 1835 1840 

35 Phe Tyr Phe Asp Pro He Glu Phe Asn Leu Val Thr Gly Trp Gin Thr 

1845 1850 1855 

He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
60 1860 186 5 1870 

Thr Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp 
1875 18 80 1B 85 

65 yag n M6t Gln LeU Gly Va 0 l Phe LyS Gly Pro AS P G1 y p he Glu Tvr 

* u 1895 1900 

Phe Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala lie 

191° 1915 1920 
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Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Glv Lys Lys Tyr Tyr Phe 
1925 1930 * 1935 

Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg lie He Asn Asn Glu 
1940 1945 1950 

Lys Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin 
1955 i960 1965 • 

Val He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He 
197 ° 1975 1980 

Ser Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tyr Tyr Phe Asp Thr 
1985 1990 1995 2 000 

Asp Thr Ala He Ala Phe Asn Gly Tyr Lys Thr lie Asp Gly Lys His 
2005 2010 2015 

Phe Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr 
" U 2020 2025 2030 

Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 
2035 2040 2045 

2? lie Glu Gly Gin Ala lie Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn 

20S0 2055 2060 

Gly Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 
M) * 065 2070 2075 2080 

Gin Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 
2085 2090 2095 

Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lvs Tyr Tyr Phe Asn 
- V 2100 2105 ' 2110 

Thr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr lie Asp Gly Lvs 
2115 2120 2125 

40 Lys Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr 

2130 2135 2140 

He He Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Met Gin 
4S 21AS 21 50 2 i55 2160 

He Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro A] a 
2165 2170 2175 

Asn Thr Asp Ala Asn Asn Tie Glu Gly Gin Ala He Leu Tvr Gin Asn 
2 1 80 2185 2190 

Glu Phe Leu Thr Leu Asn Gly Lvs Lys Tyr Tyr Phe Gly Ser Asp Ser 
2195 2200 2205 

.V Lys Ala Val Thr Gly Trp Arg He He Asn Asn Lvs Lys Tvr Tyr Phe 

2210 2215 2220 

Asn Pro Asn Asn Ala lie Ala Ala He His Leu Cys Thr lie Asn Asn 
222b 2230 2235 2240 

Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tvr lie 
2245 2250 2255 

Thr He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu S*r Lys 
2260 2265 2270 

Met Val Thr Gly Val Phe Lys Gly Pro Asn Glv Phe Glu Tyr Phe Ala 
2275 2280 ' 2285 
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Pro Ala Asn Thr His Asn Asn Asn lie Glu Gly Gin Ala lie Val Tvr 
2290 2295 2300 

Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asp Asn 
2305 2310 2315 2320 

Asp Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr 
2325 2330 2 335 

Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He 
2340 2345 2350 

Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2 365 

Gly T^n Gln Thr Ue ASP Gly LyS Lys Tyr T ^ r Phe As " Thr Asn Thr 
2370 2375 2380 

Phe lie Ala Ser Thr Gly Tyr Thr Ser lie Asn Gly Lvs His Phe Tyr 
2385 2390 2395 ' 2400 

Phe Asn Thr Asp Gly He Met Gin lie Gly Val Phe Lvs Gly Pro Asn 
2405 2410 " 2415 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn lie Glu 
2420 2425 2430 

Gly Gin Ala lie Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lvs 
24 35 2440 2445 

Lys Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Trr 
2450 2455 2460 

Ue Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 
2465 2470 2475 2480 

Thr Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
2485 2490 2495 

Thr Ser lie Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Phe 
2500 2505 2510 

Tyr Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Glv P-o 
2515 2520 2525 

AGP SlL ?he GiU Tyi " Phe Ala Pro Aia Asn Thr As P Ala Asn Asn lie 
^ bJ0 2535 2540 

Glu Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
*" 545 2550 2555 2560 

Asn He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val 
2 565 2570 2575 

Thr lie Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Glv 
2580 25B5 2590 

Ala Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arg Asn 
2595 2600 2605 

° iy ™ Pl '° GlU IiS Gly Val Phe Lys Ser Asn Gly Phe Glu Tyr 

2610 2615 2620 

Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala lie 
2625 2630 2635 2640 

Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys lie Tvr Tyr Phe 
264 5 2650 ' 2655 
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Gly Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys 
2660 2665 2670 

Val Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala Ala Gly Gly Leu 
2675 26B0 2685 

Phe ^L Ue ASP Gly Val Ile Tyr Phe Phe G1 V v al Asp Glv Val Lys 
2690 2695 2 700 

Ala Pro Gly He Tyr Gly 
2705 2 7io 

i:M INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : unknown 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ser Tyr Lys Ile Ile Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 
1 5 10 ■ 15 

Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr Phe 
20 25 30 

Ala Pro Ala Asn Thr Gin Asn Asn Asn Ile Glu Gly Gin Ala He Val 
3 5 4 0 45 

Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lvs Lys Tyr Tvr Phe Asp 
50 55 60 * 

Asn Asn Ser Lys Ala Val Thr Gly Trp Arg Ile lie Asn Asn Glu Lys 
65 70 75 8 o 

Tyr Tyr Phe Asn Pro Asn Asn Ala Ile Ala Ala Val Gly Leu Gin Val 
85 go 95 

lie Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala lie Ile Ser 
100 105 110 

Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tyr Tyr Phe Asp Thr Asp 
Hj 120 125 

Thr Ala Ile Ala Phe Asn Gly Tyr Lys Thr Ile Asp Gly Lys His Phe 
130 135 140 

Tyr Phe Asp Ser Asp Cys Val Val Lys Ile Gly Val Phe Ser Thr Ser 
145 150 155 i 6 o 

Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn Ile 
165 170 175 

Glu Gly Gin Ala Ile Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly 
180 185 iyQ 

Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu Gin 
195 200 205 

Thr lie Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu Ala 
210 215 220 

Ala Thr Gly Trp Gin Thr Ile Asp Gly Lys Lys Tyr Tyr Phe Asn Thr 
225 230 235 * 240 
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Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys 
245 250 255 

Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr He 
260 265 270 

lie Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Met Gin He 
2*75 280 285 

Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn 
290 295 300 

Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tyr Gin Asn Glu 

,5 305 310 315 320 

Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser Lys 
325 330 335 



10 
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Ala val Thr Gly Trp Arg He He Asn Asn Lys Lys Tvr Tyr Phe Asn 

340 345 ' 350 

Pro Asn Asn Ala He Ala Ala lie His Leu Cys Thr He Asn Asn Asp 

355 360 365 

Lys Tyr Tyr Phe Ser Tyr Asp Gly lie Leu Gin Asn Gly Tyr He Thr 

370 375 3Q0 

lie Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys Met 

385 390 395 400 

Val Thr Giy Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro 

405 410 415 

Aia Asn T nr His Asn Asn Asn lie Glu Gly Gin Ala He Val Tvr Gin 

-° 420 425 ,,30 * 



30 



40 



50 



Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Aso Asn Asp 
435 440 445 

Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lvs Tvr Tyr 
450 455 460 



Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp 
45 465 470 475 480 

Giy Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly 
485 490 495 



Trp Gin Thr lie Asp Gly Lys Lys Tyr Tvr Phe Asn Thr Asn Thr Phe 

500 505 510 

He Ala Ser Thr Gly Tyr Thr Ser lie Asn Gly Lys His Phe Tvr Phe 

515 520 525 

Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro Asn Glv 

530 535 540 



Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn lie Glu Glv 
6f) 545 550 555 560 

Gin Ala lie Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys 
565 570 575 

T Y r T Y r phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr He 
580 sss S9 l 

Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val Thr 
595 600 605 
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Gly Trp Gin Thr lie Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
610 615 6 20 

Ser lie Ala Ser Thr Gly Tyr Thr lie lie Ser Gly Lys His Phe Tyr 
b " 5 630 €35 6 4o 

Phe Asn Thr Asp Gly He Met Gin lie Glv Val Phe Lys Glv Pro Asn 
645 6 50 * 655 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn lie Glu 
660 665 670 

Gly Gin Ala lie Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp Asn 
65 680 685 

He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val Thr 



700 



lie Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly Ala 
b 710 715 720 

Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arg Asn Gly 
72 $ 730 735 

Leu Pro Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu T-r Phe 
740 745 750 ' 

Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Ara 
755 760 765 

Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys He Tyr Tyr Phe Gly 
770 7 ?5 7B0 

Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr lie Asn Gly Lvs Val 
785 790 7 95 ' BOO 

Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala 
805 810 

-) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 

Ser Tyr Lys lie lie Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 
1 5 10 15 

Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Glv Phe Glu Tyr Phe 
20 25 30 

Ala Pro Ala Asn Thr Gin Asn Asn Asn lie Glu Gly Gin Ala lie Val 
35 40 ' 45 

Tyr "Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lvs Tyr Tyr Phe Asp 
50 55 60 

Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He He Asn Asn Glu Lvs 

65 70 -7C: y 

Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala 
85 90 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fix} FEATURE: 

(A) NAME /KEY : CDS 

(B> LOCATION: 1 . . 7098 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATG ACT TTA GTT AAT AGA AAA CAG TTA GAA AAA ATG GCA AAT GTA AGA 
Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Val Arc 
1 5 10 is 

IS Arl Thr ^ TAT GTT GCA ATA TTG GAT GCT "A GAA 

Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala lie Leu Asp Ala Leu Glu 

-° 25 3 0 

GAA TAT CAT AAT ATG TCA GAG AAT ACT GTA GTC GAA AAA TAT TTA AAA 
Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu J£ 



40 



45 



lit f£ tT A ^ AGT TTA ACA GAT ATT TAT ATA GAT ACA TAT AAA 

Leu Lys Asp lie Asn Ser Leu Thr Asp lie Tyr lie Asp Thr Tyr Lys 

55 60 

AAA TCT GGT AGA AAT AAA GCC TTA AAA AAA TTT AAG GAA TAT CTA GTT 
Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 

70 75 80 

Thr G^ vll 7T A rt G f™ ™* MT ^ T AAT TTA ACT CCA GTT GAG AAA 
Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 

85 90 95 

AAT TTA CAT TTT GTT TGG ATT GGA GGT CAA ATA AAT GAC ACT GCT ATT 
Asn Leu His Phe Val Trp He Gly Gly Gin lie Asn Asp Thr Ala III 
100 105 no 

AAT TAT ATA AAT CAA TGG AAA GAT GTA AAT AGT GAT TAT AAT GTT AAT 
Asn Tyr lie Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn vll £1 
115 120 125 

vIT Ihl T AT o GT AAT GCA TTT TTG ATA AAC ACA ™3 AAA AAA ACT 

val Phe lyr Asp Ser Asn Ala Phe Leu He Asn Thr Leu Lys Lys Thr 
1JU 13 5 140 

Sal Sit G^ If r A?" tT A f T GAT ACA CTT GAA TCA TTT AGA GAA AAC 
Val Val Glu Ser Ala lie Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 

150 155 160 

TTA AAT GAC CCT AGA TTT GAC TAT AAT AAA TTC TTC AGA AAA CGT ATG 
Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 

170 175 

tT* tT T £ AT GAT CAG AAT TTC ATA AAC TAC TAT AAA GCT 

Glu lie He Tyr Asp Lys Gin Lys Asn Phe He Asn Tyr Tyr Lys Ala 
180 185 190 

£t* AGA St* St* AAT CCT GAA CTT ATA ATT GAT GAT ATT GTA AAG ACA 
Gin Arg Glu Glu Asn Pro Glu Leu lie lie Asp Asp He Val Lys t£t 
195 200 205 
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TAT CTT TCA AAT GAG TAT TCA AAG GAG ATA GAT GAA CTT AAT ACC TAT 
Tyr Leu Ser Asn Glu Tyr Ser Lys Glu He Asp Glu Leu Asn Thr Tyr 
0 215 220 

tV ^ S** l CC TTA ATT ACA CAG ^ AGT GGA AAT GAT GTT 

lie Glu Glu Ser Leu Asn Lys He Thr Gin Asn Ser Gly Asn Asp Val 

- 25 230 235 * 240 

AGA AAC TTT GAA GAA TTT AAA AAT GGA GAG TCA TTC AAC TTA TAT GAA 
Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
245 250 255 

CAA GAG TTG GTA GAA AGG TGG AAT TTA GCT GCT GCT TCT GAC ATA TTA 
Gin Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp lie Leu 
260 265 2? 5 

AGA ATA TCT GCA TTA AAA GAA ATT GGT GGT ATG TAT TTA GAT GTT GAT 
Arg lie Ser Ala Leu Lys Glu He Gly Gly Met Tyr Leu Asp Val Asp 
275 280 285 

ATG TTA CCA GGA ATA CAA CCA GAC TTA TTT GAG TCT ATA GAG AAA CCT 
Met Leu Pro Gly lie Gin Pro Asp Leu Phe Glu Ser He Glu Lys Pro 
290 295 300 

AGT TCA GTA ACA GTG GAT TTT TGG GAA ATG ACA AAG TTA GAA GCT ATA 
Ser Ser Val Thr Val Asp Phe Trp Glu Met Thr Lvs Leu Glu Ala lie 
0j 310 315 ' 320 

ATG AAA TAC AAA GAA TAT ATA CCA GAA TAT ACC TCA GAA CAT TTT GAC 
Met Lys Tyr Lys Glu Tyr lie Pro Glu Tyr Thr Ser Glu His Phe Asp 
325 330 335 

ATG TTA GAC GAA GAA GTT CAA AGT AGT TTT GAA TCT GTT CTA GCT TCT 
Met Leu Asp Glu Glu Val Gin Ser Ser Phe Glu Ser Val Leu Ala Ser 
340 345 3 5 o 

AAG TCA GAT AAA TCA GAA ATA TTC TCA TCA CTT GGT GAT ATG GAG GCA 
Lys Ser Asp Lys Ser Glu lie Phe Ser Ser Leu Gly Asp Met Glu Ala 
355 360 365 

TCA CCA CTA GAA GTT AAA ATT GCA TTT AAT AGT AAG GGT ATT ATA AAT 
Ser Pro Leu Glu Val Lys He Ala Phe Asn Ser Lys Gly He He Asn 
37 0 375 3eo 

CAA GGG CTA ATT TCT GTG AAA GAC TCA TAT TGT AGC AAT TTA ATA GTA 
Gin Gly Leu lie Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu He Val 
385 390 395 40 0 

AAA CAA ATC GAG AAT AGA TAT AAA ATA TTG AAT AAT AGT TTA AAT CCA 
Lys Gin He Glu Asn Arg Tyr Lys He Leu Asn Asn Ser Leu Asn Pro 
405 410 4i5 

GCT ATT AGC GAG GAT AAT GAT TTT AAT ACT ACA ACG AAT ACC TTT ATT 
Ala He Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe He 
42 0 425 430 

GAT AGT ATA ATG GCT GAA GCT AAT GCA GAT AAT GGT AGA TTT ATG ATG 
Asp ser He Met Ala Glu Ala Asn Ala Asp Asn Glv Arg Phe Met Met 
435 440 ' 445 

GAA CTA GGA AAG TAT TTA AGA GTT GGT TTC TTC CCA GAT GTT AAA ACT 
Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
450 455 460 

ACT ATT AAC TTA AGT GGC CCT GAA GCA TAT GCG GCA GCT TAT CAA GAT 
Thr lie Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gin Asd 
465 470 475 480 
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10 



TTA TTA ATG TTT AAA GAA .GGC AGT ATG AAT ATC CAT TTG ATA GAA GCT 14 8 8 

Leu Leu Met Phe Lys Glu Gly Ser Met Asn lie His Leu lie Glu Ala 
485 490 495 

GAT TTA AGA AAC TTT GAA ATC TCT AAA ACT AAT ATT TCT CAA TCA ACT 1536 
Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn He Ser Gin Ser Thr 
500 505 510 

GAA CAA GAA ATG GCT AGC TTA TGG TCA TTT GAC GAT GCA AGA GCT AAA 1584 
Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
515 520 525 

GCT CAA TTT GAA GAA TAT AAA AGG AAT TAT TTT GAA GGT TCT CTT GGT 16 32 

Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Glv 
I:n 530 535 540 

GAA GAT GAT AAT CTT GAT TTT TCT CAA AAT ATA GTA GTT GAC AAG GAG 
Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val Asp Lys Glu 
20 545 550 555 560 

TAT CTT TTA GAA AAA ATA TCT TCA TTA GCA AGA AGT TCA GAG AGA GGA 
Tyr Leu Leu Glu Lys He Ser Ser Leu Ala Arg Ser Ser Glu Arq Glv 
565 570 " 575 

25 TAT ATA CAC TAT ATT GTT CAG TTA CAA GGA GAT AAA ATT AGT TAT GAA 

Tyr lie His Tyr He Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 

GCA GCA TGT AAC TTA TTT GCA AAG ACT CCT TAT GAT AGT GTA CTG TTT 18 24 

Mi Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 
595 600 605 



40 



60 



65 



CAG AAA AAT ATA GAA GAT TCA GAA ATT GCA TAT TAT TAT AAT CCT GGA 
Gin Lys Asn He Glu Asp Ser Glu He Ala Tyr Tyr Tyr Asn Pro Gly 
-O fiio 615 



620 



GAT GGT GAA ATA CAA GAA ATA GAC AAG TAT AAA ATT CCA AGT ATA ATT 
Asp Gly Glu He Gin Glu He Asp Lys Tyr Lys He Pro Ser He lie 
625 630 635 



640 



700 



ATC AAC GTA GAG GAG ACT TAT CCT GGA AAA TTA TTA CTT AAA GTT AAA 
He Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lvs Val Lvs 
705 710 * 



715 720 



1680 



1728 



1776 



1872 



1920 



TCT GAT AGA CCT AAG ATT AAA TTA ACA TTT ATT GGT CAT GGT AAA GAT 196 8 

Ser Asp Arg Pro Lys He Lys Leu Thr Phe He Gly His Gly Lys Asp 
645 650 655 

45 GAA TTT AAT ACT GAT ATA TTT GCA GGT TTT GAT GTA GAT TCA TTA TCC -016 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
660 665 670 

ACA GAA ATA GAA GCA GCA ATA GAT TTA GCT AAA GAG GAT ATT TCT CCT 2 064 

Thr Glu He Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 
675 680 685 

AAG TCA ATA GAA ATA AAT TTA TTA GGA TGT AAT ATG TTT AGC TAC TCT 
Lys Ser He Glu He Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 
^ * 690 695 - — 



2ii; 



2160 



GAT AAA ATA TCA GAA TTA ATG CCA TCT ATA AGT CAA GAC TCT ATT ATA 7*>08 

Asp Lys He Ser Glu Leu Met Pro Ser He Ser Gin Asp Ser He He 
725 730 735 

GTA AGT GCA AAT CAA TAT GAA GTT AGA ATA AAT AGT GAA GGA AGA AGA 2^56 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arq Arq 
74 ° 745 750 
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G^ 7L» 7™ ^ AT 5 AT TCT GGT GAA TGG ATA AAA GAA GAA ACT ATT 

Glu Leu Leu Asp His Ser Gly Glu Trp lie Asn Lys Glu Glu Ser lie 
755 760 7 65 

ATA AAG GAT ATT TCA TCA AAA GAA TAT ATA TCA TTT AAT CCT AAA GAA 
He Lys Asp lie Ser Ser Lys Glu Tyr He Ser Phe Asn Pro Lys 

AAT AAA ATT ACA GTA AAA TCT AAA AAT TTA CCT GAG CTA TCT ACA TTA 
Asn Lys lie Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr llu 
85 790 795 goo 

lit rt* ^ tT T AGA ^ TCT AAT TCA AGT GAT ATT GAA CTA GAA 

Leu Gin Glu He Arg Asn Asn Ser Asn Ser Ser Asp He Glu Leu Glu 
805 BIO 81S 

GAA AAA GTA ATG TTA ACA GAA TGT GAG ATA AAT GTT ATT TCA AAT ATA 
Glu Lys Val Met Leu Thr Glu Cys Glu He Asn Val He Ser JJJ n 
820 825 830 

GAT ACG CAA ATT GTT GAG GAA AGG ATT GAA GAA GCT AAG AAT TTA afT 
Asp Thr Gin He Val Glu Glu Arg He Glu Glu All ™ Tn7 

835 840 845 

TCT GAC TCT ATT AAT TAT ATA AAA GAT GAA TTT AAA CTA ATA GAA TCT 
Asp ser lie Asn Tyr He Lys Asp Glu Phe Lys Leu He Glu" III 

855 ego 

ill III Aso A1 A f TA r CT f ° TTA ^ CAA CAG AAT GA A "A GAA GAT 
He Ser Asp Ala Leu Cys Asp Leu Lys Gin Gin Asn Glu Leu Glu Asp 

875 9QQ 

III H A I pT tT A I CT ^ GAG GAC ATA TCA GAG ACT GAT GAG GGA TTT 
Ser Hxs Phe He Ser Phe Glu Asp He Ser Glu Thr Asp Glu Glv Phe 

885 890 8 9 5 

AGT ATA AGA TTT ATT AAT AAA GAA ACT GGA GAA TCT ATA TTT GTA GAA 
Ser He Arg Phe He Asn Lys Glu Thr Gly Glu Ser lie III vll G^ 
900 90S 910 

ACT GAA AAA ACA ATA TTC TCT GAA TAT GCT AAT CAT ATA ACT GAA rir 
Thr Glu Lys Thr He Phe Ser Glu Tyr Ala Asn Hil He Thr Sft Glu 
315 920 925 

ATT TCT AAG ATA AAA GGT ACT ATA TTT GAT ACT GTA AAT GGT AAG TTA 
lie Ser Lys He Lys Gly Thr lie Phe Asp Thr Val Si cly leu 

JU 935 940 

GTA AAA AAA GTA AAT TTA GAT ACT ACA CAC GAA GTA AAT ACT TTA AAT 
Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr lIu 

950 955 960 

^ T ^ A ll T T l T ATA CAA TCA TTA ATA GAA TAT AAT AGT TCT AAA GAA 
Ala Ala Phe Phe lie Gin Ser Leu He Glu Tyr Asn Ser Ser Zvs Glu 
965 970 975 

TCT CTT AGT AAT TTA AGT GTA GCA ATG AAA GTC CAA GTT TAC GCT CAA 
Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gin Val Tyr All SJ 
980 985 99Q 

TTA TTT AGT ACT GGT TTA AAT ACT ATT ACA GAT GCA GCC AAA GTT GTT 
Leu Phe Ser Thr Gly Leu Asn Thr lie Thr Asp Ala Ala vll vIT 

995 1000 1005 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2668 



2736 



2784 



2832 



2880 



2928 



2976 



3024 
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60 



GAA TTA GTA TCA ACT GCA TTA GAT GAA ACT ATA GAC TTA CTT CCT ACA 
Glu Leu Val Ser Thr Ala Leu Asp Glu Thr lie Asp Leu Leu Pro Thr 
1°10 1015 1020 

TTA TCT GAA GGA TTA CCT ATA ATT GCA ACT ATT ATA GAT GGT GTA AGT 
Leu Ser Glu Gly Leu Pro lie He Ala Thr He He Asp Gly Val Ser 
1025 1030 1035 1040 



AAA GAT TTA ATG GTA TTA CCT AAT GCT CCA AAT AGA GTA TTT GCT TGG 
Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
l22 0 1225 1230 

GAA ACA GGA TGG ACA CCA GGT TTA AGA AGC TTA GAA AAT GAT GGC ACA 
Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Gly Thr 
1235 1240 1245 

AAA CTG TTA GAC CGT ATA AGA GAT AAC TAT GAA GGT GAG TTT TAT TGG 
Lys Lou Leu Asp Arg He Arg Asp Asn Tyr Glu Gly Glu Phe Tvr Trp 
lfc50 1255 1260 

65 AGA TAT TTT GCT TTT ATA GCT GAT GCT TTA ATA ACA ACA TTA AAA CCA 

Arg Tyr Phe Ala Phe He Ala Asp Ala Leu lie Thr Thr Leu Lys Pro 
1265 1270 1275 i 280 



3072 



3120 



3216 



3264 



3312 



3360 



TTA GGT GCA GCA ATC AAA GAG CTA AGT GAA ACG AGT GAC CCA TTA TTA 316 8 

IU Leu Gly Ala Ala lie Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 

1045 1050 1055 

AGA CAA GAA ATA GAA GCT AAG ATA GGT ATA ATG GCA GTA AAT TTA ACA 
Arg Gin Glu lie Glu Ala Lys He Gly lie Met Ala Val Asn Leu Thr 
]? 1060 1065 1070 

ACA GCT ACA ACT GCA ATC ATT ACT TCA TCT TTG GGG ATA GCT AGT GGA 
Thr Ala Thr Thr Ala He He Thr Ser Ser Leu Gly He Ala Ser Gly 
-, 0 1075 1080 108S 

TTT AGT ATA CTT TTA GTT CCT TTA GCA GGA ATT TCA GCA GGT ATA CCA 
Phe Ser He Leu Leu Val Pro Leu Ala Gly lie Ser Ala Gly lie Pro 
1° 90 1095 iioo 

25 AGC TTA GTA AAC AAT GAA CTT GTA CTT CGA GAT AAG GCA ACA AAG GTT 

oer Leu Val Asn Asn Glu Leu Val Leu Arg Asp Lys Ala Thr Lys Val 

HIO 1H5 1120 

M) u T t ^ T l AT TTT ^ CAT GTT TCA TTA GTT GAA ACT GAA GGA GTA TTT 34 00 

M) Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 

1125 i 130 1135 

ACT TTA TTA GAT GAT AAA ATA ATG ATG CCA CAA GAT GAT TTA GTG ATA 
Thr Leu Leu Asp Asp Lys lie Met Met Pro Gin Asp Asp Leu Val lie 
H40 H45 use 

TCA GAA ATA GAT TTT AAT AAT AAT TCA ATA GTT TTA GGT AAA TGT GAA 
Ser Glu lie Asp Phe Asn Asn Asn Ser lie Val Leu Gly Lys Cvs Glu 
II 55 H60 H65 

ATC TGG AGA ATG GAA GGT GGT TCA GGT CAT ACT GTA ACT GAT GAT ATA 
He Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asp He 
1170 1175 iiao 

^ GAT CAC TTC TTT TCA GCA CCA TCA ATA ACA TAT AGA GAG CCA CAC TTA 

Asp Has Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro His Leu 
1185 ll 90 1195 1200 

TCT ATA TAT GAC GTA TTG GAA GTA CAA AAA GAA GAA CTT GAT TTG TCA 3 648 

Ser Iie T Vr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 
1205 1210 1215 



3456 



3504 



3552 



3600 



3696 



3744 



3792 



3840 
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AGA TAT GAA GAT ACT AAT ATA AGA ATA AAT TTA GAT AGT AAT ACT AGA 
Arg Tyr Glu Asp Thr Asn He Arg He Asn Leu Asp Ser Asn Thr Arq 
1285 1290 1295 

AGT TTT ATA GTT CCA ATA ATA ACT ACA GAA TAT ATA AGA GAA AAA TTA 
Ser Phe lie Val Pro lie lie Thr Thr Glu Tyr lie Arg Glu Lys Leu 
1300 1305 * 13 io 



AAA ATC TCT CTT TCT TTG ACT CTA CAA GAT GAA AAA ACT ATA AAG TTA 
Lys lie Ser Leu Ser Leu Thr Leu Gin Asp Glu Lys Thr He Lys Leu 
1540 1545 1550 



3888 



3936 



4032 



4080 



4128 



4176 



1() I CA l AT l CT TTC TAT GGT TCA GGA GGA ACT TAT GCA TTG TCT CTT TCT 3 964 

H) Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 

1315 1320 1325 

CAA TAT AAT ATG GGT ATA AAT ATA GAA TTA AGT GAA AGT GAT GTT TGG 
Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Tro 
1330 1335 i3 4 o 

ATT ATA GAT GTT GAT AAT GTT GTG AGA GAT GTA ACT ATA GAA TCT GAT 
lie He Asp Val Asp Asn Val Val Arg Asp Val Thr He Glu Ser Asd 

1 J4b 1350 lice: " 

2() 1355 1360 

AAA ATT AAA AAA GGT GAT TTA ATA GAA GGT ATT TTA TCT ACA CTA AGT 
Lys He Lys Lys Gly Asp Leu lie Glu Gly He Leu Ser Thr Leu Ser 
1365 1370 1375 

2> ATT GAA GAG AAT AAA ATT ATC TTA AAT AGC CAT GAG ATT AAT TTT TCT 

He Glu Glu Asn Lys He He Leu Asn Ser His Glu lie Asn Phe Ser 
1380 1385 139Q 

M) r? T w T ? G ? A AGT ^ GGA TTT GTT TCT TTA ACA TTT TCA ATT 4224 

->» Gly Glu Val Asn Gly Ser Asn Gly Phe Val Ser Leu Thr Phe Ser He 

1395 1400 1405 

TTA GAA GGA ATA AAT GCA ATT ATA GAA GTT GAT TTA TTA TCT AAA TCA 

?^n Giy Ile Asn Ala Ile Ue Glu Val As P Ser Lys ser 

1/110 1415 1420 

TAT AAA TTA CTT ATT TCT GGC GAA TTA AAA ATA TTG ATG TTA AAT T~A 
ryr Lys Leu Leu He Ser Gly Glu Leu Lys Ile Leu Met Leu Asn Ser 
4() 25 1430 1435 1440 

AAT CAT ATT CAA CAG AAA ATA GAT TAT ATA GGA TTC AAT AGC GAA TTA 
Asn His lie Gin Gin Lys Ile Asp Tyr Ile Gly Phe Asn Ser Glu Leu 
1445 1450 1455 

45 CAG AAA AAT ATA CCA TAT AGC TTT GTA GAT AGT GAA GGA AAA GAG AAT 

G.n Lys Asn lie Pro Tyr Ser Phe Val Asp Ser Glu Gly Lys Glu Asn 
1460 14S5 147Q 

GGT TTT ATT AAT GGT TCA ACA AAA GAA GGT TTA TTT GTA TCT GAA TTA 4464 
W Gly Phe He Asn Gly Ser Thr Lys Glu Gly Leu Phe Val Ser Glu Leu 

1475 1480 148S 

CCT GAT GTA GTT CTT ATA AGT AAG GTT TAT ATG GAT GAT AGT AAG CC 

Pr ° ??£ ft Val Val LeU Ue Ser Lys Val ^ Met As P As P Lys Pro 

1490 1495 isoo 

TCA TTT GGA TAT TAT AGT AAT AAT TTG AAA GAT GTC AAA GTT ATA ACT 
oer Phe Gly Tyr Tyr Ser Asn Asn Leu Lys Asp Val Lys Val Ile Thr 
6() 1505 1510 "IB 1520 

AAA GAT AAT GTT AAT ATA TTA ACA GGT TAT TAT CTT AAG GAT GAT ATA 
Lys Asp Asn Val Asn lie Leu Thr Gly Tyr Tyr Leu Lvr, Asp Asp lie 
1525 i53 0 ' p 15 ^ 5 



4272 



4320 



4368 



4416 



4512 



4560 



4608 



4656 
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AAT AGT GTG CAT TTA GAT GAA AGT GGA GTA GCT GAG ATT TTG AAG TTC 
Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu He Leu Lys Phe 
1555 1560 1565 

ATG AAT AGA AAA GGT AAT ACA AAT ACT TCA GAT TCT TTA ATG AGC TTT 

Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 158O 

TTA GAA AGT ATG AAT ATA AAA AGT ATT TTC GTT AAT TTC TTA CAA TCT 
Leu Glu Ser Met Asn lie Lys Ser lie Phe Val Asn Phe Leu Gin Ser 
1585 1590 1595 1600 

AAT ATT AAG TTT ATA TTA GAT GCT AAT TTT ATA ATA AGT GGT ACT ACT 
Asn He Lys Phe lie Leu Asp Ala Asn Phe He He Ser Gly Thr Thr 
1605 1610 i 615 

TCT ATT GGC CAA TTT GAG TTT ATT TGT GAT GAA AAT GAT AAT ATA CAA 
Ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn He Si 
1620 1625 1630 

Cca TAT TTC ATT AAG TTT AAT ACA CTA GAA ACT AAT TAT ACT TTA TAT 
Pro Tyr Phe He Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tyr 
1635 1640 1645 y 

V^t r?f ^ AAT ATG ATA GTG GAA CCA AAT TAT ^T TTA GAT 

Val Cly^Asn Arg Gin Asn Mettle Val Glu Pro Asn^Tyr Asp Leu Asp 

GAT TCT GGA GAT ATA TCT TCA ACT GTT ATC AAT TTC TCT CAA AAG TAT 
Asp ser Gly Asp He Ser Ser Thr Val He Asn Phe Ser Gin" L^s TCr 
1665 1670 1675 * ileo 

CTT TAT GGA ATA GAC AGT TGT GTT AAT AAA GTT GTA ATT TCA CCA AAT 
Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val He Ser Pro A^n 
1685 1690 16g5 

tTI -T AT GAT GAA ATA AAT ATA ACG CCT GTA TAT ACA AAT AAT 

He ryr Thr Asp Glu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 

1700 1705 1710 

Thr lt T o CA ?, T 1 ATT GTA TTA GAT GCA TAT ATA AAT GAA AAA 

Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Asn Glu Lys 
1715 1720 1725 

tT A ^ ST AAT A T C *** ° AT CTA TCT ATA CGA TAT GTA TGG AGT AAT 
lie Asn val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1730 1735 1740 

GAT GGT AAT GAT TTT ATT CTT ATG TCA ACT AGT GAA GAA AAT AAG GTG 
Asp Gly Asn Asp Phe He Leu Met Ser Thr Ser Glu gIu Asn L^s vll 

1750 1755 1760 

ler rft SIT tT A AGA TTC GTT AAT GTT TTT ^ GAT AAG ACT TTG 

Ser Gin Val Lys lie Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 

1765 1770 1775 

GCA AAT AAG CTA TCT TTT AAC TTT AGT GAT AAA CAA GAT GTA CCT GTA 
Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro vll 
1780 1785 1790 

S G r rt^ tT A A T° T TA TCA TTT ACA CCT TCA TAT TAT GAG GAT GGA TTG 
Ser Glu lie He Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Glv Leu 
1795 1800 i 8 05 

ill r?° T AT GAT TTG GGT CTA GTT TCT TTA TAT AAT GAG AAA TTT TAT 
He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lys Phe TyT 
1810 1815 1820 yl 



4704 



4752 



4800 



4848 



4896 



4944 



4992 



5040 



5088 



5136 



5184 



5232 



5280 



5328 



5376 



5424 



547: 
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ATT AAT AAC TTT GGA ATG ATG GTA TCT GGA TTA ATA TAT ATT AAT GAT 
lie Asn Asn Phe Gly Met Met Val Ser Gly Leu lie Tyr lie Asn Asp 
1825 lf >30 1835 i 8 40 

TCA TTA TAT TAT TTT AAA CCA CCA GTA AAT AAT TTG ATA ACT GGA TTT 
Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1845 18S0 18 55 



AAA ATT TAC TAT TTT GAT GAT TCA TTT ACA GCT GTA GTT GGA TGG AAA 
Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala Val Val Gly Tro Lvs 
-065 2070 2075 



080 



6> GAT TTA GAG GAT GGT TCA AAG TAT TAT TTT GAT GAA GAT ACA GCA GAA 

Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp Glu Asp Thr Ala Glu 
2085 2090 2095 



5520 



5568 



!0 ?, ? £u GGC ° AT GAT *AA TAC TAC TTT AAT CCA ATT AAT GGT GGA 56 16 

10 Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro lie Asn Gly Gly 

"60 1865 167 0 

GCT GCT TCA ATT GGA GAG ACA ATA ATT GAT GAC AAA AAT TAT TAT TTC 
Ala Ala Ser He Gly Glu Thr He lie Asp Asp Lys Asn Tyr Tvr Phe 
n 1875 1880 1885 

AAC CAA AGT GGA GTG TTA -CAA ACA GGT GTA TTT AGT ACA GAA GAT GGA 
Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe Ser Thr Glu Asd Glv 
i () 1890 1895 1900 

TTT AAA TAT TTT GCC CCA GCT AAT ACA CTT GAT GAA AAC CTA GAA GGA 
Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp Glu Asn Leu Glu Gly 
1905 1910 1915 1920 

2D GAA GCA ATT GAT TTT ACT GGA AAA TTA ATT ATT GAC GAA AAT ATT TAT 

Glu Ala He Asp Phe Thr Gly Lys Leu lie He Asp Glu Asn He Tvr 
1925 1930 1935 * 

TAT TTT GAT GAT AAT TAT AGA GGA GCT GTA GAA TGG AAA GAA TTA GAT 5 8 56 

Iyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu Trp Lys Glu Leu Asp 
1940 1945 1950 

GGT GAA ATG CAC TAT TTT AGC CCA GAA ACA GGT AAA GCT TTT AAA GGT 



5664 



5712 



5760 



5808 



5904 



5952 



6000 



6048 



Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly Lys Ala Phe Lys Glv 
- K 1955 i960 1965 

CTA AAT CAA ATA GGT GAT TAT AAA TAC TAT TTC AAT TCT GAT GGA GTT 
Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe Asn Ser Asp Gly Val 
4() 1970 1975 1980 

ATG CAA AAA GGA TTT GTT AGT ATA AAT GAT AAT AAA CAC TAT TTT GAT 
Met Gin Lys Gly Phe Val Ser lie Asn Asp Asn Lys His Tvr Phe Asd 
1985 1990 1995 * 2000 

45 GAT TCT GGT GTT ATG AAA GTA GGT TAC ACT GAA ATA GAT GGC AAG CAT 

Asp Ser Gly Val Met Lys Val Gly Tyr Thr Glu He Asp Gly Lys His 
2005 2010 ' 2015 

TTC TAC TTT GCT GAA AAC GGA GAA ATG CAA ATA GGA GTA TTT AAT ACA 60 96 

Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He Gly Val Phe Asn Thr 
2020 2025 2030 

GAA GAT GGA TTT AAA TAT TTT GCT CAT CAT AAT GAA GAT TTA GGA AAT 614 4 

Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn Glu Asp Leu Gly Asn 
- 3 2035 2040 2045 

GAA GAA GGT GAA GAA ATC TCA TAT TCT GGT ATA TTA AAT TTC AAT AAT 619'' 
Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He Leu Asn Phe Asn Asn 
2050 2055 2060 
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GCA TAT ATA GGT TTG TCA TTA ATA AAT GAT GGT CAA TAT TAT TTT AAT 
Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

GAT GAT GGA ATT ATG CAA GTT GGA TTT GTC ACT ATA AAT GAT AAA GTC 
Asp Asp Gly He Met Gin Val Gly Phe Val Thr He Asn Asp Lys Val 
2115 2120 2125 



TTT GCA CAT CAA AAT ACT TTG GAT GAG AAT TTT GAG GGA GAA TCA ATA 
Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe Glu Gly Glu Ser He 
2305 2 310 2315 2320 

AAC TAT ACT GGT TGG TTA GAT TTA GAT GAA AAG AGA TAT TAT TTT ACA 
Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tvr Tyr Phe Thr 
2325 2330 * 2335 

GAT GAA TAT ATT GCA GCA ACT GGT TCA GTT ATT ATT GAT GGT GAG GAG 
Asp Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Gly Glu Giu 
2340 2345 2 350 

TAT TAT TTT GAT CCT GAT ACA GCT CAA TTA GTG ATT AGT GAA 
Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val He Ser Glu 
2355 2360 2 3 6 5 



6336 



6384 



TTC TAC TTC TCT GAC TCT GGA ATT ATA GAA TCT GGA GTA CAA AAC ATA 64 32 

1" Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser Gly Val Gin Asn He 
213 0 2135 2140 

GAT GAC AAT TAT TTC TAT ATA GAT GAT AAT GGT ATA GTT CAA ATT GGT 
Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly lie Val Gin lie Gly 
- " 145 2150 2155 2160 

GTA TTT GAT ACT TCA GAT GGA TAT AAA TAT TTT GCA CCT GCT AAT ACT 
Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
2() 2165 2170 2175 

GTA AAT GAT AAT ATT TAC GGA CAA GCA GTT GAA TAT AGT GGT TTA GTT 
val Asn Asp Asn lie Tyr Gly Gin Ala Val Glu Tyr Ser Gly Leu Val 
2180 2185 2190 

25 AGA GTT GGG GAA GAT GTA TAT TAT TTT GGA GAA ACA TAT ACA ATT GAG 

Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr lie Glu 
2195 2200 2205 

ACT GGA TGG ATA TAT GAT ATG GAA AAT GAA AGT GAT AAA TAT TAT TTC 66 72 

Thr Gly Trp He Tyr Asp Met Glu Asn Glu Ser Asp Lys Tyr Tyr Phe 
2210 2215 2220 

AAT CCA GAA ACT AAA AAA GCA TGC AAA GGT ATT AAT TTA ATT GAT GAT 

^? c Pl "° GlU Thr LyS LyS Ala Cys Gl y Ile Asn Leu He Asp Asp 

-° 2225 2230 2235 2240 

ATA AAA TAT TAT TTT GAT GAG AAG GGC ATA ATG AGA ACG GGT CTT ATA 
lie Lys Tyr Tyr Phe Asp Glu Lys Gly Ile Met Arg Thr Gly Leu He 
2245 2250 2255 



6480 



6528 



6576 



6624 



6720 



6768 



40 

TCA TTT GAA AAT AAT AAT TAT TAC TTT AAT GAG AAT GGT GAA ATG CAA 

Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu Asn Gly Glu Met Gin 
2260 2265 2270 

45 TTT GGT TAT ATA AAT ATA GAA GAT AAG ATG TTC TAT TTT GGT GAA GAT 

Phe Gly Tyr He Asn Ile Glu Asp Lys Met Phe Tvr Phe Gly Glu Asp 
2275 2280 * 2285 

GGT GT< 7 ATG CAG ATT GTA TTT AAT ACA CCA GAT GGA TTT AAA TAC 6 912 

>0 Gly Val Met Gin Ile Gly Val Phe Asn Thr Pro Asp Gly Phe Lys Tyr 

2290 2295 2300 



6816 



6864 



6960 



7008 



7056 



7098 
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(2) INFORMATION FOR SEQ ID NO:i0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2366 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Val Arq 
5 10^ 15 

Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala He Leu Asp Ala Leu Glu 
20 -ytz 



30 



Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu Lvs 
35 4 0 45 

Leu Lys Asp He Asn Ser Leu Thr Asp lie Tyr lie Asp Thr Tyr Lys 



60 



Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 

70 7 * 80 

Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lvs 
85 90 g5 ' 

Asn Leu His Phe Val Trp lie Gly Gly Gin He Asn Asp Thr Ala lie 
100 105 no 

Asn Tyr lie Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 
115 120 125 

Val Phe Tyr Asp Ser Asn Ala Phe Leu lie Asn Thr Leu Lys Lys Thr 
i - 5U 135 i 4 o 

Val Val Glu Ser Ala lie Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 
b 150 155 i 60 

Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
165 170 175 

Glu He He Tyr Asp Lys Gin Lys Asn Phe He Asn Tyr Tyr Lys Ala 
180 185 190 

Gin Arg Glu Glu Asn Pro Glu Leu He He Asp Asp He Val Lys Thr 
195 200 205 

Tyr Leu Ser Asn Glu Tyr Ser Lys Glu He Asp Glu Leu Asn Thr Tyr 
* 10 215 2 20 

He Glu Glu Ser Leu Asn Lys He Thr Gin Asn Ser Gly Asn Asp Val 
"~ 5 230 235 240 

Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
245 250 255 

Gin Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp He Leu 
260 265 270 

Arg lie Ser Ala Leu Lys Glu He Gly Gly Met Tyr Leu Asp Val Asd 
275 280 285 

Met Leu Pro Gly lie Gin Pro Asp Leu Phe Glu Ser He Glu Lys Pro 
290 295 



300 



Ser Ser Val Thr Val Asp Phe Trp Glu Met Thr Lys Leu Glu Ala He 
5 310 315 320 

- 250 



WO 98/08540 



PCT/US97/15394 



10 



15 



20 



40 



45 



50 



60 



65 



Met Lys Tyr Lys Glu Tyr He Pro Glu Tyr Thr Ser Glu 
325 330 

Met Leu Asp Glu Glu Val Gin Ser Ser Phe Glu Ser Val 
340 345 

Lys Ser Asp Lys Ser Glu He Phe Ser Ser Leu Gly Asp 
355 360 365 

Ser Pro Leu Glu Val Lys lie Ala Phe Asn Ser Lys Glv 
370 375 380 

Gin Gly Leu He Ser Val Lys Asp Ser Tyr Cys Ser Asn 
385 390 395 

Lys Gin He Glu Asn Arg Tyr Lys He Leu Asn Asn Ser 
405 410 

Ala He Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn 
420 425 

Asp Ser lie Met Ala Glu Ala Asn Ala Asp Asn Gly Arq 
4 35 440 44 | 

Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp 
450 455 460 

Thr He Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala 
4 65 470 475 

Leu Leu Met Phe Lys Glu Gly Ser Met Asn He His Leu 
485 490 

Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn He Ser 
500 so5 

Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala 
5 " 520 H ^ 525 

Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly 
530 535 5 4o 

Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val 
545 550 555 



His Phe Asp 
335 

Leu Ala Ser 
350 

Met Glu Ala 

He He Asn 

Leu lie Val 
400 

Leu Asn Pro 
415 

Thr Phe He 
430 

Phe Met Met 

Val Lys Thr 

Tyr Gin Asp 
480 

He Glu Ala 
495 

Gin Ser Thr 
510 

Arg Ala Lys 
Ser Leu Gly 



Asp Lys Glu 
560 



Tyr Leu Leu Glu Lys lie Ser Ser Leu Ala Arg Ser Ser 
565 570 

Tyr He His Tyr He Val Gin Leu Gin Gly Asp Lys He 
580 585 

Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser 
595 600 6 o5 

Gin Lys Asn lie Glu Asp Ser Glu He Ala Tyr Tvr Tvr 
610 615 6 20 

Asp Gly Glu lie Gin Glu He Asp Lys Tyr Lys lie Pro 
625 630 635 

Ser Asp AVg Pro Lys lie Lys Leu Thr Phe lie Gly His 
645 650 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp 
660 665 

Thr Glu lie Glu Ala Ala He Asp Leu Ala Lys Glu Asp 
675 6B0 685 



Glu Arg Glv 
575 

Ser Tyr Glu 
590 

Val Leu Phe 
Asn Pro Gly 



Ser lie lie 
640 

Gly Lys Asp 
655 

Ser Leu Ser 
670 

He Ser Pro 
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Lys Ser He Glu He Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 
690 695 700 

lie Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lys Val Lys 
05 710 715 720 

Asp Lys He Ser Glu Leu Met Pro Ser lie Ser Gin Asp Ser He He 
725 730 735 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arg Arg 
740 745 750 

Glu Leu Leu Asp His Ser Gly Glu Trp He Asn Lys Glu Glu Ser He 
|5 755 7 60 765 

He Lys Asp He Ser Ser Lys Glu Tyr He Ser Phe Asn Pro Lys Glu 
770 775 780 

Asn Lys lie Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr Leu 
785 790 795 eoo 

Leu Gin Glu He Arg Asn Asn Ser Asn Ser Ser Asp He Glu Leu Glu 
805 810 815 

2:> Glu Lys Val Met Leu Thr Glu Cys Glu lie Asn Val He Ser Asn He 

820 825 830 

Asp Thr Gin lie Val Glu Glu Arg He Glu Glu Ala Lvs Asn Leu Thr 
30 835 845 

Ser Asp Ser He Asn Tyr lie Lys Asp Glu Phe Lys Leu lie Glu Ser 
850 855 860 

^ lit ASP Ala LeU CyS Asp LeU ^ Gln Gln Asn Glu Leu Glu ASO 

865 870 875 880 

Ser His Phe He Ser Phe Glu Asp He Ser Glu Thr Asp Glu Gly Phe 
88 5 890 895 

Ser He Arg Phe He Asn Lys Glu Thr Gly Glu Ser He Phe Val Glu 
900 905 910 

Thr Glu Lys Thr lie Phe Ser Glu Tyr Ala Asn His He Thr Glu Glu 
915 920 925 

He Ser Lys lie Lys Gly Thr lie Phe Asp Thr Val Asn Glv Lys Leu 
930 935 940 

SO LyS LyS Val Asn LeU Asp Thr Thr Hls Glu Val Asn Thr Leu Asn 

• U 945 950 955 9 6 o 

Ala Ala Phe Phe He Gln Ser Leu He Glu Tyr Asn Ser Ser Lys Glu 
965 970 975 

:o Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gln Val Tyr Ala Gln 

980 985 990 

Leu Phe ser Thr Gly Leu Asn Thr He Thr Asp Ala Ala Lys Val Val 
fi(J 995 1000 1005 

Glu Leu Val Ser Thr Ala Leu Asp Glu Thr He Asp Leu Leu Pro Thr 
1010 1015 1020 

^ c Ser G1U Gly Leu pro Ile Ile Ala Thr Ile Iie As P Gly Val Ser 
03 1025 • 103 0 1035 1040 

Leu Gly Ala Ala He Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 
i045 1050 1055 



40 



45 
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Arg Gin Glu He Glu Ala Lys lie Gly He Met Ala Val Asn Leu Thr 
1060 1065 107 0 

Thr Ala Thr Thr Ala lie lie Thr Ser Ser Leu Gly lie Ala Ser Gly 

1075 1080 108 5 

^ ?I Ue LSU LCU Val Pr ° Leu Ala G1 V Ile Ala Gly lie Pro 

1090 1095 1100 

Ser Leu Val Asn Asn Glu Leu Val Leu Arg Asp Lys Ala Thr Lys Val 

1110 ins i 120 

Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
H25 H30 1135 

Thr Leu Leu Asp Asp Lys Ile Met Met Pro Gin Asp Asp Leu Val lie 
1140 1145 11S0 

Ser Glu Il| 5 Asp Phe Asn Asn Asn^Ser Ile Val Leu Gly Lys Cys Glu 



1165 



He Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asp He 
1170 U75 iiso F P S 

Asp His Phe Phe Ser Ala Pro Ser lie Thr Tyr Arg Glu Pro His Leu 

1190 H95 120 0 

Ser He Tyr Asp Val Leu Glu Val Cln Lys Glu Glu Leu Asp Leu Ser 
1 20S 1210 12 15 

LyS Asp Leu Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Tro 

1220 1225 i 2 3o 

Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Glv Thr 
^■ i!> 1240 



1245 



Lys Leu Leu Asp Arg Ile Arg Asp Asn Tyr Glu Glv Glu Phe Tyr Trp 
1250 1255 126O P 

Arg Tyr Phe Ala Phe Ile Ala Asp Ala Leu Ile Thr Thr Leu Lvs Pro 

1270 1275 ' i280 

Arg Tyr Glu Asp Thr Asn lie Arg lie Asn Leu Asp Ser Asn Thr Arg 
1265 1290 i29 5 

Ser Phe lie Val Pro He lie Thr Thr Glu Tyr lie Arg Glu Lys Leu 
1 - so ° 1305 1310 

Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 
1315 1320 1325 

Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Trp 

1335 i3 4 o 

lie lie Asp val Asp Asn Val Val Arg Asp Val Thr lie Glu Ser Asp 
1350 1355 i3 P 0 

Lys lie Lys Lys Gly Asp Leu He Glu Gly He Leu Ser Thr Leu Ser 
1365 i 370 1375 

He Glu Glu Asn Lys He He Leu Asn Ser His Glu He Asn Phe Ser 
1380 1385 1390 

Gly Glu Val__Asn Gly Ser Asn Gly^Phe Val Ser Leu Thr^Phe Ser lie 

Leu Glu Gly lie Asn Ala. lie lie Glu Val Asp Leu Leu Ser Lys Ser 

1415 1420 



Tyr Lys Leu Leu lie Ser Gly Glu Leu Lys lie Leu Met Leu Asn Ser 
1425 1430 1435 1440 

Asn His lie Gin Gin Lys lie Asp Tyr lie Gly Phe Asn Ser Glu Leu 
1445 1450 1455 

Gin Lys Asn lie Pro Tyr Ser Phe Val Asp Ser Glu Gly Lys Glu Asn 
1460 1465 1470 

Gly Phe lie Asn Gly Ser Thr Lys Glu Gly Leu Phe Val Ser Glu Leu 
1475 1480 1485 

Pro Asp Val Val Leu lie Ser Lys Val Tyr Met Asp Asp Ser Lvs Pro 
1490 1495 isoo 

Ser Phe Gly Tyr Tyr Ser Asn Asn Leu Lys Asp Val Lys Val He Thr 
1505 i5lD 1515 i5 2 o 

Lys Asp Asn Val Asn lie Leu Thr Gly Tyr Tyr Leu Lys Asp Asp lie 
1525 1530 1535 

Lys He Ser Leu Ser Leu Thr Leu Gin Asp Glu Lys Thr He Lys Leu 
1540 154 5 1550 

Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu He Leu Lvs Phe 
i555 1560 1565 

Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 1580 

Leu Glu Ser Met Asn He Lys Ser lie Phe Val Asn Phe Leu Gin Ser 
1585 1 59 0 1595 1600 

Ann He Lys Phe He Leu Asp Ala Asn Phe He lie Ser Gly Thr Thr 
1605 1610 1615 

Ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn II* Gin 
i620 1625 1630 

Pro Tyr Phe lie Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tvr 
1635 1640 1645 

Val Gly Asn Arg Gin Asn Met He Val Glu Pro Asn Tyr Asp Leu Asp 
1650 1655 i 66 o 

Asp Ser Gly Asp He Ser Ser Thr Val He Asn Phe Ser Gin Lys Tyr 
1665 1670 1675 1680 

Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val He Ser Pro Asn 
1685 1690 1695 

lie Tyr Thr Asp Glu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 
1700 1705 1710 

Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Asn Glu Lys 
1715 1720 1725 

He Asn Val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1 73 0 1735 1740 

Asp Gly A5n Asp Phe He Leu Met Ser Thr Ser Glu Glu Asn Lys Val 
1745 1750 1755 1760 

Ser Gin Val Lys He Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 
1765 1770 1775 

Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro Val 
1780 1785 1790 
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Ser Glu He He Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Gly Leu 
1795 1800 X805 

He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lvs Phe Tyr 
1810 1815 1820 

He Asn Asn Phe Gly Met Met Val Ser Gly Leu He Tyr He Asn Asp 
1825 1830 1835 1840 

Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1846 1850 1855 

Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro lie Asn Gly Gly 
15 1860 1865 1870 

Ala Ala Ser He Gly Glu Thr lie He Asp Asp Lys Asn Tyr Tvr Ph*» 
i87 * 1880 1885 

Asn Gin Ser Cly Val Leu Gin Thr Gly Val Phe Ser Thr Glu Asp Glv 
- u 1"° 1895 1900 

Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp Glu Asn Leu Glu Glv 
1905 1910 i9i5 l9 y 2Q 

2> Glu Ala He Asp Phe Thr Gly Lys Leu He lie Asp Glu Asn He Tyr 

1925 1930 1935 

Tyr Phe Asp Asp Asn Tyr Arq Gly Ala Val Glu Trp Lys Glu Leu Asn 
M) i94 ° 1945 1950 

Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly Lys Ala Phe Lys Glv 
1955 i960 1965 

Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe Asn Ser Asp Gly Val 
•° 1970 1975 i960 

Met Gin Lys Gly Phe Val Ser He Asn Asp Asn Lys His Tyr Phe Asp 
1985 1990 1995 2Q00 

Asp Ser Gly Val Met Lys Val Gly Tyr Thr Glu He Asp Gly Lys His 
2005 2010 2015 

Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He Gly Val Phe Asn Thr 
2020 2025 2030 

Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn Glu Asp Leu Glv Asn 
2035 2040 2045 

Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He Leu Asn Phe Asn Asn 
MJ 2 050 2055 2060 

Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala Val Val Gly Trp Lys 
2065 2070 2075 2080 

Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp Glu Asp Thr Ala Glu 
2085 2090 2095 

Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

Asp Asp Gly He Met Gin Val Gly Phe Val Thr Ho Asn Asp Lvs Vai 
2H5 2120 2125 

Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser Gly Val Gin Asn lie 
2130 217.5 2140 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin lie Gly 
2145 2150 2155 2160 

70 val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
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2165 2170 2175 

Val Asn Asp AsnUe Tyr Gly Gin Ala^Val Glu Tyr Ser Gly Leu Val 



2190 



Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr He Glu 

^ 00 2205 

Thr Gly Trp lie Tyr Asp Met Glu Asn Glu Ser Asp Lys Tyr Tyr P he 

a215 2220 

Asn Pro Glu Thr Lys Lys Ala Cv iv«s riw ti^ * 

2225 2230 V ^35 6U Ile As P As P 



2240 

He Lys Tyr Tyr Phe Asp Glu Lys Gly lie Met Arg Thr Gly Leu He 

2250 2255 
Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn n., n^ ^ 

20 2265 2270 

Phe Gly Tyr lie Asn He Glu Asp Lys Met Phe Tyr Phe Gly 01u Asp 

<- 280 2285 

25 G1V 2290 M6t UC GI/ ™ Phe Asn Thr P - Asp Gly Phe Lys Tyr 

2300 

Phe Ala Hi. Gin Asn Thr Leu Asp Glu Asn Phe Glu Gly Glu Ser lie 
' 31 ° 23 " 2320 



Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tyr Tyr Phe Thr 

2330 2335 
As P Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Gly Glu Glu 

2345 2350 
Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val lie Ser Glu 

" 355 2360 2365 

(.*?) INFORMATION FOR SEQ ID NO: 11: 
( 



( l ) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 19 base pairs 
<B) TYPE : nucleic acid 
a <: *C) STRANDEDNESS : single 

H " <D> TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: II: 

TAGAAAAAAT GGCAAATGT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH ; 21 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

fa TTTCATCTTG TAGACTCAAA G 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS ■ 
(A) LENGTH: 22 base pairs 
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(B> TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GATGCCACAA GATGATTTAG TG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CTAATTGAGC TGTATCAGGA TC 

(2) INFORMATION FOR SEQ ID NO: 15: 

'.if SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGGAATTCCT AGAAAAAATG GCAAATG 

(2) INFORMATION FOR SEQ ID NO: 16: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GCTCTAGAAT GACCATAAGC TAGCCA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAATTCGA GTTGGTAGAA AGGTGGA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CGGAATTCGG TTATTATCTT AAGGATG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2B base pairs 
fB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE : DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGAATTCTT GATAACTGGA TTTGTGAC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(il) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Leu He Thr Gly Phe Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn 
5 10 15 

Pro lie Asn Gly' Gly Ala Ala Ser He Gly Glu Thr lie He Asp Asp 
20 25 30 

Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Glv Val Phe 
35 40 45 

Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp 
iU 55 eo 

Glu Asn Leu Glu Gly Glu Ala He Asp Phe Thr Gly Lys Leu He lie 
65 70 75 80 

Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu 
85 90 g5 

Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr Glv 
100 105 110 

Lys Ala Phe Lys Gly Leu Asn Gin He Gly Asp Tyr Lys Tvr Tyr Phe 
115 120 125 * 

Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He Asn Asp Asn 
130 135 i4Q 

Lys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tvr Thr Glu 
45 150 155 ' 160 

He Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He 
165 1 7 0 175 

Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn 
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180 185 



190 



Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu He Ser Tyr Ser Gly lie 
195 200 205 

Leu Asn Phe Asn Asn Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala 
'■ LU 215 220 

Val Val Gly Trp Lys Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asd 

230 235 24 5 

Glu Asp Thr Ala Glu Ala Tyr lie Gly Leu Ser Leu He Asn Asp Gly 

2Sr 255 

Gin Tyr Tyr Phe Asn Asp Asp Gly He Met Gin Val Gly Phe Val Thr 
260 265 270 

He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser 

230 285 

Gly Val Gin Asn He Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly 

295 300 

He Val Gin lie Gly Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe 

10 315 3 2o 

Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu 
325 330 335 

Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu 
340 345 3 50 

Thr Tyr Thr He Glu Thr Gly Trp He Tyr Asp Met Glu Asn Glu Ser 
b 360 365 

Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Glv lie 
U 375 

Asn Leu He Asp Asp lie Lys Tyr Tyr Phe Asp Glu Lys Gly He Met 

390 395 400 

Arg Thr Gly Leu lie Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu 
405 410 415 

Asn Giv Glu Met Gin Phe Gly Tyr lie Asn He Glu Asp Lvs Mcr Phe 
420 125 ., 30 

Tyr Phe Gly Glu Asp Gly Val Met Gin He Giy Val Phe Asr. Thr Pro 

440 



445 



Asp Gly Phe Lys Tyr Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe 

455 460 

Glu Gly Glu Ser lie Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys 

' 475 480 

Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val lie 

485 ™° 495 

He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
500 S05 510 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
< D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Glu Glu Asn Lys Val Ser Gin Val Lys He Arg Phe Val Asn Val 
1 5 10 15 

Phe Lys Asp Lys Thr Leu Ala Asn Lys Leu Ser Phe Asn Phe Ser Aso 
20 25 30 

Lys Gin Asp Val Pro Val Ser Glu He He Leu Ser Phe Thr Pro Ser 
35 40 45 

Tyr Tyr Glu Asp Gly Leu He Gly Tyr Asp Leu Gly Leu Val Ser Leu 
50 55 60 

Tyr Asn Glu Lys Phe Tyr lie Asn Asn Phe Gly Met Met Val Ser Gly 
65 7 ° 75 ao 

Leu He Tyr He Asn Asp Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn 
85 go 95 

Asn Leu He Thr Gly Phe Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe 
100 105 no 

Asn Pro He Asn Gly Gly Ala Ala Ser lie Glv Glu Thr He lie Asp 
US 120 ' 125 

Asp Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val 
130 135 140 

Phe Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu 
145 150 155. 160 

Asp Glu Asn Leu Glu Gly Glu Ala He Asp Phe Thr Gly Lys Leu He 
165 170 175 

He Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Glv Ala Va^ 
180 185 190 

Glu Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr 
195 200 205 

Gly Lys Ala Phe Lys Gly Leu Asn Gin He Gly Asp Tyr Lys Tyr Tvr 
210 215 220 

Phe Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He Asn Asn 
225 230 235 240 

Asn Lys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tyr Thr 
245 250 255 

Glu lie Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin 
260 265 270 

He Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His Hi"> 
275 260 285 

Asn Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu lie Scr Tyr Ser Glv 
290 295 300 

lie Leu Asn Phe Asn Asn Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr 

- 260 - 
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320 



305 310 3i5 

Ala Val Val Gly Trp Lys Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe 
325 330 335 

Asp Glu Asp Thr Ala Glu Ala Tyr lie Gly Leu Ser Leu He Asn Asd 
340 345 350 H 

Gly Gin Tyr Tyr Phe Asn Asp Asp Gly lie Met Gin Val Gly Phe Val 
355 360 365 

Thr lie Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He lie Glu 
370 375 3 80 

Ser Gly Val Gin Asn lie Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn 
390 395 400 

Gly He Val Gin lie Gly Val Phe Asp Thr Ser Asp Gly Tyr Lys Tvr 

20 405 410 4i5 ' 

Phe Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val 

420 425 4 3 o 

^ GlU Tyr ?" Gly Leu Val Ar ? Val Gly Glu Asp Val Tyr Tyr Phe Gly 

*"* 435 440 - - - 



15 



30 



40 



45 



60 



65 



70 



445 



Glu Tnr Tyr Thr lie Glu Thr Gly Trp He Tyr Asp Met Glu Asn Glu 

455 4S0 

Ser Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly 
465 470 475 48u 

He Asn Leu lie Asp Asp lie Lys Tyr Tyr Phe Asp Glu Lys Gly He 
35 485 490 4 95 

Met Arg Thr Gly Leu He Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn 
500 505 510 

Glu Asn Gly Glu Met Gin Phe Gly Tyr lie Asn lie Glu Asp Lys Met 
515 520 525 ' 

Phe Tyr Phe Gly Glu Asp Gly Val Met Gin lie Gly Val Phe Asn Thr 
J 535 5 4 o 

Pro Asp Gly Phe Lys Tyr Phe Ala His Gin Asn Thr Leu Asd Glu Asn 
550 555 • 560 

Phe Glu Gly Glu Ser lie Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu 
50 565 570 57S 

Lys Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val 



585 59 0 

Phe Asp Pro Asp Thr 
600 605 



He He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
(2) INFORMATION FOR SEQ ID NO: 22: 



fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 , . 1314 
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45 



50 



60 



65 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATG GCT CGT CTG CTC TCT ACC TTC ACT GAA TAC ATC AAG AAC ATC ATC 
Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr He Lys Asn He lie 
1 5 io 



15 



AAT ACC TCC ATC CTG AAC CTG CGC TAC GAA TCC AAT CAC CTG ATC GAC 
Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His Leu He Asd 
20 25 30 

CTG TCT CGC TAC GCT TCC AAA ATC AAC ATC GGT TCT AAA GTT AAC TTC 
Leu Ser Arg Tyr Ala Ser Lys He Asn He Gly Ser Lys Val Asn Phe 
35 40 45 

GAT CCG ATC GAC AAG AAT CAG ATC CAG CTG TTC AAT CTG GAA TCT TCC 
Asp Pro lie Asp Lys Asn Gin lie Gin Leu Phe Asn Leu Glu Ser Ser 
50 55 60 

^ ATC GAA GTT ATC CTG AAG AAT GCT ATC GTA TAC AAC TCT ATG TAC 

/al Tyr Asn Ser Met Tyr 
75 80 



4B 



96 



144 



192 



240 



yjl 1 mj\Ks AA l <JCT ATC GTA TAC AAC TCT ATG TAC 

Lys He Glu Val He Leu Lys Asn Ala lie Val Tyr Asn Ser Met Tyr 
65 7Q 1 

GAA AAC TTC TCC ACC TCC TTC TGG ATC CGT ATC CCG AAA TAC TTC AAC 2fifl 
Glu Asn Phe Ser Thr Ser Phe Trp lie Arg He Pro Lys Tyr Phe Asn 
85 90 95 

TCC ATC TCT CTG AAC AAT GAA TAC ACC ATC ATC AAC TGC ATG GAA AAC 3 36 

Ser He Ser Leu Asn Asn Glu Tyr Thr lie He Asn Cys Met Glu Asn 
M) 100 105 no 

AAT TCT GGT TGG AAA GTA TCT CTG AAC TAC GGT GAA ATC ATC TGG ACT 3 84 

Asn Ser Gly Trp Lys Val Ser Leu Asn Tyr Gly Glu He He Trp Thr 
115 120 125 

3> CTG CAG GAC ACT CAG GAA ATC AAA CAG CGT GTT GTA TTC AAA TAC TCT 4 32 

Leu Gin Asp Thr Gin Glu He Lys Gin Arg Val Val Phe Lys Tyr Ser 
130 135 140 

CAG ATG ATC AAC ATC TCT GAC TAC ATC AAT CGC TGG ATC TTC GTT ACC 4 80 

**" Gin Met He Asn lie Ser Asp Tyr lie Asn Arg Trp lie Phe Val Thr 

145 150 15 5 160 

ATC ACC AAC AAT CGT CTG AAT AAC TCC AAA ATC TAC ATC AAC GGC CGT 52 8 

lie Thr Asn Asn Arg Leu Asn Asn Ser Lys lie Tyr He Asn Gly Arg 
165 170 175 

CTG ATC GAC CAG AAA CCG ATC TCC AAT CTG GGT AAC ATC CAC GCT T^- 5 7 6 
Leu lie Asp Gin Lys Pro He Ser Asn Leu Gly Asn He His Ala Ser 
180 185 190 

AAT AAC ATC ATG TTC AAA CTG GAC GGT TGT CGT GAC ACT CAC CGC TAC 624 
Asn Asn lie Met Phe Lys Leu Asp Gly Cys Arg Asp Thr His Arg Tyr 
195 200 205 

:0 ATC TGG ATC AAA TAC TTC AAT CTG TTC GAC AAA GAA CTG AAC GAA AAA 6 72 

lie Trp lie Lys Tyr Phe Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys 
210 215 220 

GAA ATC AAA GAC CTG TAC GAC AAC CAG TCC AAT TCT GGT ATC CTG AAA 7^0 
Glu He Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly He Leu Lv«3 
225 * 230 235 240 

GAC TTC TGG GGT GAC TAC CTG CAG TAC GAC AAA CCG TAC TAC ATG CTG 76 8 

Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu 
245 250 255 

AAT CTG TAC GAT CCG AAC AAA TAC GTT GAC GTC AAC AAT GTA GGT ATC 816 
Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly He 
7() 260 265 2 70 
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CGC GGT TAC ATG TAC CTG AAA GGT CCG CGT GGT TCT GTT ATG ACT ACC 664 
Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 
275 280 265 

AAC ATC TAC CTG AAC TCT TCC CTG TAC CGT GGT ACC AAA TTC ATC ATC 912 
Asn He Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe He He 
290 295 300 

AAG AAA TAC GCG TCT GGT AAC AAG GAC AAT ATC GTT CGC AAC AAT GAT 96 0 

Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn He Val Arg Asn Asn Asp 
305 310 315 320 

CGT GTA TAC ATC AAT GTT GTA GTT AAG AAC AAA GAA TAC CGT CTG GCT 1008 
Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala 
325 330 335 

ACC AAT GCT TCT CAG GCT GGT GTA GAA AAG ATC TTG TCT GCT CTG GAA 10 56 

Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
340 345 350 

ATC CCG GAC GTT GGT AAT CTG TCT CAG GTA GTT GTA ATG AAA TCC AAG U04 
lie Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 
355 360 365 

AAC GAC CAG GGT ATC ACT AAC AAA TGC AAA ATG AAT CTG CAG GAC AAC 1152 
Asn Asp Gin Gly He Thr Asn Lvs Cvs Lys Met Asn Leu Gin Asp Asn 
370 375 380 

AAT GGT AAC GAT ATC GGT TTC ATC GGT TTC CAC CAG TTC AAC AAT ATC 1200 
Asn Gly Asn Asp lie Gly Phe He Gly Phe His Gin Phe Asn Asn He 
385 390 395 400 

GCT AAA CTG GTT GCT *TCC AAC TGG TAC AAT CGT CAG ATC GAA CGT TCC 124 8 

Ala Lys Leu Val Ala Ser Asn Trp Tyr Asn Arg Gin lie Glu Arg Ser 
405 410 415 

TCT CGC ACT CTG GGT TGC TCT TGG GAG TTC ATC CCG GTT GAT GAC GGT 12 96 

Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asp Glv 
420 425 430 

TGG GGT GAA CGT CCG CTG TAACCCGGGA AAGCTT 1330 
Trp Gly Glu Arg Pro Leu 
435 

45 (2) INFORMATION FOR SEQ ID NO: 23: 



15 



20 



30 



40 



50 



60 



65 



( I ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 438 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23; 



Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr He Lys Asn lie He 

15 10 15 

Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His Leu He Asd 
20 25 30 

Leu Ser Arg Tyr Ala Ser Lys lie Asn He Gly Ser Lys Val Asn Phe 
35 40 45 

Asp Pro lie Asp Lys Asn Gin He Gin Leu Phe Asn Leu Glu Ser Ser 
50 55 60 

Lys He Glu Val He Leu Lys Asn Ala lie Val Tyr Asn Ser Met Tvr 
70 65 ™ 75 eo 
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10 



20 



65 



70 



Glu Asn Phe Ser Thr Ser Phe Trp He Arg lie Pro Lys Tyr Phe Asn 

85 90 95 

Ser He Ser Leu Asn Asn Glu Tyr Thr lie He Asn Cys Met Glu Asn 

Asn Ser Gly Trp Lys Val Ser Leu Asn Tyr Gly Glu lie lie Trp Thr 

Leu Gin Asp Thr Gin Glu lie Lys Gin Arg Val Val Phe Lys Tyr Ser 

135 140 

Gin Met lie Asn He Ser Asp Tyr He Asn Arg Trp lie Phe Val Thr 

150 155 160 

He Thr Asn Asn Arg Leu Asn Asn Ser Lys He Tyr lie Asn Gly Arq 
165 170 7 a 



175 



Leu lie Asp Gin Lys Pro lie Ser Asn Leu Gly Asn lie His Ala Ser 
180 185 190 

Asn Asn lie Met Phe Lys Leu Asp Gly Cys Arg Asp Thr His Arg Tyr 



05 



He Trp He Lys Tyr Phe A Leu phe Asp Lys ^ ^ ^ ^ 

- 15 220 
Glu lie Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly lie Leu Lys 
30 230 2 " 24o 

Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu 

250 255 
, . Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly He 

265 270 

Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 
" 75 =80 285 

40 Asn lie Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lvs Phe Ue lie 
* au 295 300 

Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn He Val Arg Asn Asn Asp 

45 " U0 315 320 

Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala 
325 330 335 

Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
340 345 350 

He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 
5 360 365 

55 Asn Asp Gin Gly He Thr Asn Lys Cys Lys Met Asn Leu Gin Asp Asn 

375 380 

Asn Gly Asn Asp lie Gly Phe Ue Gly Phe His Gin Phe Asn Asn He 
60 390 3 95 400 

Ala Lys Leu Val Ala Ser Asn Trp Tyr Asn Arg Gin lie Glu Arg Ser 
405 410 41 | 

Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asp Gly 



420 42 5 

Trp Gly Glu Arg Pro Leu 
435 

(2) INFORMATION FOR SEQ ID NO: 24: 



430 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Gly His His His His His His His His His His Ser Ser Gly Hi- 

1 5 io 15 y 

He Glu Gly Arg His Met Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1402 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 1386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

mJ? r?^ 5^ 5 AT CAT CAT CAT CAT CAT CAT CAC AGC AG ^ GGC CAT 4 8 

Met Gly His His His His His His His His His His Ser Ser Gly H^s 

5 10 15 

n f CGT CAT ATG GCT AGC ATG GCT CGT CTG CTG TCT ACC T t C 

He Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Pne 
20 25 30 

Thr G^ Tvr t£ f C tT° A T C ™ ACC TCC ATC CTG ^ C CTG GGG 

Tni Glu Tyr lie Lys Asn He lie Asn Thr Ser lie Leu Asn Leu Arq 

35 40 45 y 

TAC GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC 
Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser lie 

55 60 

£n tT° n? T I CT *** GTT ** C TTC GAT CCG ATC GAC AAG AAT CAG ATC 
Asn lie Gly Ser Lys Val Asn Phe Asp Pro lie Asp Lys Asn Gin lie 

70 75 so 

CAG CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT 
Gin Leu Phe Asn Leu Glu Ser Ser Lys lie Glu Val lie Leu Lys aJJ 
85 90 i 5 

GCT ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC rrr 
Ala lie Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr ler Phe Trp 
100 105 il0 

ATC CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC 
He Arg le Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu T^r 
115 120 125 

Thr n f tT° TGC ATG GAA ** C *** TCT GGT TGG AAA GTA TCT CTG 

Thr lie lie Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu 

1JU !35 140 

AAC TAC GGT GAA ATC ATC TGG ACT CTG CAG GAC ACT CAG GAA ATC AAA 
Asn Tyr Gly Glu lie lie Trp Thr Leu Gin Asp Thr Gin G^ He 
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96 



144 



192 



240 



288 



336 



384 



432 



480 



WO 98/08540 



PCT/US97/15394 



145 150 



155 160 



CAG CGT GTT GTA TTC AAA TAC TCT CAG ATG ATC AAC ATC TCT GAC TAC 
Gin Arg val Val Phe Lys Tyr Ser Gin Met lie Asn He Ser Asp Tyr 
165 170 175 

ATC AAT CGC TGG ATC TTC GTT ACC ATC ACC AAC AAT CGT CTG AAT AAC 
He Asn Arg Trp lie Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn 
180 185 190 

TCC AAA ATC TAC ATC AAC GGC CGT CTG ATC GAC CAG AAA CCG ATC TCC 
Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro lie Ser 
195 200 205 

AAT CTG GGT AAC ATC CAC GCT TCT AAT AAC ATC ATG TTC AAA CTG GAC 
Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asd 
210 215 220 

GGT TGT CGT GAC ACT CAC CGC TAC ATC TGG ATC AAA TAC TTC AAT CTG 
Gly Cys Arg Asp Thr His Arg Tyr lie Trp He Lys Tyr Phe Asn Leu 
225 230 2 35 2 40 

TTC GAC AAA GAA CTG AAC GAA AAA GAA ATC AAA GAC CTG TAC GAC AAC 
Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn 
245 250 255 

CAG TCC AAT TCT GGT ATC CTG AAA GAC TTC TGG GGT GAC TAC CTG CAG 
Ciln Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 
260 265 270 

TAC GAC AAA CCG TAC TAC ATG CTG AAT CTG TAC GAT CCG AAC AAA TAC 
Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr 
275 280 285 

GTT GAC GTC AAC AAT GTA GGT ATC CGC GGT TAC ATG TAC CTG AAA GGT 
Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Glv 
290 295 300 

CCG CGT GGT TCT GTT ATG ACT ACC AAC ATC TAC CTG AAC TCT TCC CTG 
Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu 
305 310 315 320 

TAC CGT GGT ACC AAA TTC ATC ATC AAG AAA TAC GCG TCT GGT AAC AAG 
Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys 
325 330 335 

GAC AAT ATC GTT CGC AAC AAT GAT CGT GTA TAC ATC AAT GTT GTA GTT 
Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr lie Asn Val Val Val 
340 345 350 

AAG AAC AAA GAA TAC CGT CTG GCT ACC AAT GCT TCT CAG GCT GGT GTA 
Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val 
355 360 365 

GAA AAG ATC TTG TCT GCT CTG GAA ATC CCG GAC GTT GGT AAT CTG TCT 
Glu Lys He Leu Ser Ala Leu Glu lie Pro Asp Val Gly Asn Leu Se>- 
370 375 380 

CAG GTA GTT GTA ATG AAA TCC AAG AAC GAC CAG GGT ATC ACT AAC AAA 
Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Glv He Thr Asn Lys 
385 * 390 395 ' 400 

TGC AAA ATG AAT CTG CAG GAC AAC AAT GGT AAC GAT ATC GGT TTC ATC 
Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He 
405 410 415 
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20 



:0 



50 





TTC 


CAC 


CAG 


TTC 


AAC 


AAT ATC 


GCT 


Gly 




Hie 




Phe 


Asn 


Asn He 


Ala 








4 2 0 








425 


TAC 


AAT 


CGT 


CAG 


ATC 


GAA 


CGT TCC 


TCT 


Tyr 


Asn 


Arg 


Gin 


He 


Glu 


Arg Ser 


Ser 






435 








440 




GAG 


TTC 


ATC 


CCG 


GTT 


GAT 


GAC GGT 


TGG 


Glu 


Phe 


He 


Pro 


Val 


Asp 


Asp Gly Trp 




400 










455 





i CTG GTT GCT TCC AAC TGG 12 96 

i Leu Val Ala Ser Asn Trp 
430 

. ACT CTG GGT TGC TCT TGG i344 

Thr Leu Gly Cys Ser Trp 
445 

If) r*\ ~ r,i~ V\"~ w " WA * w "* VJV51 ^ l GAA CGT CCG CTG i -> 

10 Glu Phe He Pro VaI a,o ion t*-- m.. Giu Arg prQ Lfiu 1396 

460 

TAACCCGGGA AAGCTT 
15 (2) INFORMATION FOR SEQ ID NO: 26: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii} MOLECULE TYPE: protein 

(xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gly His His His His His His His His His His Ser Ser Glv His 

1 S 10 - 1 



15 



He Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 30 

Thr Glu Tyr He Lys Asn He lie Asn Thr Ser He Leu Asn Leu Arg 
35 40 45 ^ 

Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Scr Lys He 
50 55 go 

Asn lie Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He 
40 70 75 80 

Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn 
85 90 95 

Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp 
100 105 110 

He Arg lie Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr 
H5 120 125 

Thr lie He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val ser Leu 
i3 ° 135 140 

Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys 

55 150 155 160 

Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr 
165 170 17 5 

He Asn Arg Trp He Phe Val Thr lie Thr Asn Asn Arg Leu Asn Asn 
180 185 i9 0 



140: 
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Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser 
195 200 2 05 

Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asd 
210 215 220 

Gly Cys Arg Asp Thr His Arg Tyr He Trp lie Lys Tyr Phe Asn Leu 
2 " 5 230 235 240 

Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn 
245 250 255 

Gin Ser Asn Ser. Gly lie Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 
260 ^ 265 270 

Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lvs Tvr 
275 280 285 - 

~>n ^ P Val ASn ASn Val Gly Ue Ar 9 G1 V T Y r Met Leu Lys Gly 

- U 290 295 300 

Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu 
30b 310 315 320 

2> Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lvs 

32 * 330 335 ' 

Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr lie Asn Val Val Val 
340 345 350 



10 



15 



30 



40 



45 



Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Glv Val 

355 360 365 

Glu Lys lie Leu Ser Ala Leu Glu He Pro Asp Val Glv Asn Leu Ser 

370 375 380 

Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lvs 

" 85 39 ° 395 400 

Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp lie Gly Phe He 
405 410 415 

Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp 

42 0 425 430 

Tyr Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cvz Ser Trp 

43 5 440 445 * H 



Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
*" 450 455 460 

(2) INFORMATION FOR SEQ ID NO: 27: 

<iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

WJ <ii> MOLECULE TYPE : DNA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .3 888 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:27: 

ATG CAA TTT GTT AAT AAA CAA TTT AAT TAT AAA GAT CCT GTA AAT GGT 
Met Gin Phe Val Asn Lys Gin Phe Asn Tyr Lys A S p Pre vll A^n Gly 
1 5 10 15 

GTT GAT ATT GCT TAT ATA AAA ATT CCA AAT GTA GGA CAA ATG CAA CCA 
Val Asp He Ala Tyr lie Lys lie Pro Asn Val Gly Gin Met Gin Pro 
20 25 3 0 

GTA AAA GCT TTT AAA ATT CAT AAT AAA ATA TGG GTT ATT CCA GAA ACA 
Val Lys Ala Phe Lys He His Asn Lys lie Trp Val III Pro S Ira 
35 40 4S y 

GAT ACA TTT ACA AAT CCT GAA GAA GGA GAT TTA AAT CCA CCA CCA GAA 
Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu ftVn Pro Pro Pro G^ 

55 go 

GCA AAA CAA GTT CCA GTT TCA TAT TAT GAT TCA ACA TAT TTA ACT ACA 
Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 

70 ? 5 80 

SI l^n CU> ^ T ^ ^ GGA GTT ACA *** TTA T ™ ^G 

A-p Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 

85 9° 95 

AGA ATT TAT TCA ACT GAT CTT GGA AGA ATG TTG TTA ACA TCA ATA GTA 
Ai-g lie Tyr ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser lie vll 
100 105 110 

Ara ftt tT A o CA IF J GG G ? T GGA AGT ACA ATA GAT ACA GAA TTA AAA 
Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu Leu Lvt 
115 120 125 

CTT ATT GAT ACT AAT TGT ATT AAT GTG ATA CAA CCA GAT GGT AGT TAT 
Val le Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tvr 

I 35 140 

AGA TCA GAA GAA CTT AAT CTA GTA ATA ATA GGA CCC TCA GCT GAT ATT 
Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro s« All As P lYe 

150 155 160 

ATA CAG TTT GAA TGT AAA AGC TTT GGA CAT GAA GTT TTG AAT CTT ACG 
He G.n Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Xsn Su Thr 

16 5 " 



170 



175 



CGA AAT GGT TAT GGC TCT ACT CAA TAC ATT AGA TTT AGC CCA GAT TTT 
Arg Asn Gly Tyr Gly S er Thr Gin Tyr He Arg III Ser Pro Asp Phi 



190 



215 



220 



CTT ATA CAT GCT GGA CAT AGA TTA TAT GGA ATA GCA ATT AAT CCA AAT 
Leu lie Hi. Ala Gly Hi. Arg Leu Tyr Gly U e A^a ill £J Pro 
'- 30 235 



240 



AGG GTT TTT AAA GTA AAT ACT AAT GCC TAT TAT GAA ATG AGT GGG TTA 
Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr G^ Met Ser Gly Uu 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



ACA TTT GGT TTT GAG GAG TCA CTT GAA GTT GAT ACA AAT CCT CTT TTA 
Thr Phe Gly .Phe Glu Glu Ser Leu Glu Val Asp Thr S Pro S Su 

200 205 

Gil A1 A r?° ?** ^ G ? T ACA GAT CCA GCA GTA TTA GCA CAT GAA 67" 

Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu ' 



720 



768 



250 255 
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20 



40 



50 



60 



65 



GAA GTA AGC TTT GAG GAA CTT AGA ACA TTT GGG GGA CAT GAT GCA AAG 
Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lvs 
260 265 270 

TTT ATA GAT AGT TTA CAG GAA AAC GAA TTT CGT CTA TAT TAT TAT AAT 
Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
275 280 285 



CGT ACT ACT GCT TCA TTA CAG TAT ATG AAA AAT GTT TTT AAA GAG AAA 
Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lvs 
3 °5 310 315 320 

TAT CTC CTA TCT GAA GAT ACA TCT GGA AAA TTT TCG GTA GAT AAA TTA 
Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 
325 330 335 

AAA TTT GAT AAG TTA TAC AAA ATG TTA ACA GAG ATT TAC ACA GAG GAT 
Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asd 
340 345 350 

AAT TTT GTT AAG TTT TTT AAA GTA CTT AAC AGA AAA ACA TAT TTG AAT 
Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
3 $5 360 365 



GAT TTA ATA CAA CAA TAT TAT TTA ACC TTT AAT TTT GAT AAT GAA CC^ 

Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 

500 505 510 

GAA AAT ATT TCA ATA GAA AAT CTT TCA AGT GAC ATT ATA GGC CAA TTA 

Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 

515 520 525 



816 



864 



AAG TTT AAA' GAT ATA GCA AGT ACA CTT AAT AAA GCT AAA TCA ATA GTA 912 
10 Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val 

290 295 300 



960 



1008 



1056 



1104 



TTT GAT AAA GCC GTA TTT AAG ATA AAT ATA GTA CCT AAG GTA AAT TAC 1152 

M) Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val Asn Tvr 
3 ~>0 375 380 

ACA ATA TAT GAT GGA TTT AAT TTA AGA AAT ACA AAT TTA GCA GCA AAC 12 00 

Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 

-° 385 390 395 400 

TTT AAT GGT CAA AAT ACA GAA ATT AAT AAT ATG AAT TTT ACT AAA CTA 124 8 

Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu 

405 410 415 

AAA AAT TTT ACT GGA TTG TTT GAA TTT TAT AAG TTG CTA TGT GTA AGA 12 96 

Lys Asn Phe Thr Gly Leu Phe Glu Phe Tvr Lys Leu Leu Cys Val Arq 

420 425 * 430 

45 GGG ATA ATA ACT TCT AAA ACT AAA TCA TTA GAT AAA GGA TAC AAT AAG 134 4 

Gly He He Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr Asn Lvs 
435 440 445 

GCA TTA AAT GAT TTA TGT ATC AAA GTT AAT AAT TGG GAC TTG TTT TTT 13 92 

Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe 
450 455 460 

AGT CCT TCA GAA GAT AAT TTT ACT AAT GAT CTA AAT AAA GGA GAA GAA 144 0 

Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 

^ 465 470 475 480 

ATT ACA TCT GAT ACT AAT ATA GAA GCA GCA GAA GAA AAT ATT AGT TTA 14 8 8 

lie Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn lie Ser Leu 

485 490 495 



1536 



1584 



GAA CTT ATG CCT AAT ATA GAA AGA TTT CCT AAT GGA AAA AAG TAT GAG 1631 
/» Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 
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530 535 



540 



795 



800 



He Pro ?S Glv Va7 l G ° J TA GAA GAT " T GAT GCT AGT CTT AAA 

Pro Tyi Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 



1680 



1728 



1776 



1824 



1920 



196 8 



Tl?, * ^ l AT ACT ATG TTC CAT TAT CTT CGT GCT CAA GAA TTT GAA 
Leu Asp Ly S Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 

545 550 55S 560 

CAT GGT AAA TCT AGG ATT GCT TTA ACA AAT TCT GTT AAC GAA GCA TTA 
His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
56 5 570 575 

7Z» o CT AGT CGT GTT TAT ACA TTT TTT TCT TC A GAC TAT GTA AAG 

Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val L^s 
580 585 590 

AAA GTT AAT AAA GCT ACG GAG GCA GCT ATG TTT TTA GGC Trr r™ ™„ 
Lys Val Asn Lys Ala Thr Glu Ala Ala Met III Uu Gly £p vll GUu 

600 6 05 

CAA TTA GTA TAT GAT TTT ACC GAT GAA ACT AGC GAA GTA AGT ACT Arc 
Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser gTC vli Ser Thr Thr ^ 

615 6 20 

aS ^ ill A?f tT A ACT ATA ATT ATT CCA TAT ATA GGA CCT GCT 

h-p Lys lie Ala Asp lie Thr lie He He Pro Tyr lie Gly Pro Ala 

630 635 640 

lit A^n jjt AAT ATG TTA TAT GAT GAT TTT GTA GGT GCT TTA 

Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 

645 650 65S 

lie Phi Ser Glv All vJI rV 7° ™ GAA TTT ATA CCA GAG ATT GCA 
lie Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu lie Ala 

660 665 670 

lie Pro v TA J TA ^ T ACT TTT GCA CTT GTA TCA TAT ATT GCG AAT AAG 
He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 

675 680 685 

vTT f TA GTT CAA ACA ATA GAT MT GCT TTA AGT AAA AGA AAT GAA 

Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg A^n Glu 

695 700 

AAA TGG GAT GAG GTC TAT AAA TAT ATA GTA ACA AAT TGG TTA GCA AAG 
Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp lIu Ala JJS 

710 715 7 5o 

S IE s s; in 2S K iK is £ ™ - - s ™ 
SI IS S SK S ffi £ K "i ni liS £ HI »; - 

s; s s S! 21 ss £ si si in is if iii « 

™ E 5? Si E IE SiS S S IS 51 SI S iS lil ili »» 

77 5 7B0 

AAT AAA TTT TTG AAT CAA TGC TCT GTT TCA TAT TTA ATG AAT TCT ATP ■,<,„„ 
Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met A^n ler 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2448 



810 



815 
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GAT GCA TTA TTA AAG TAT ATA TAT GAT AAT AGA GGA ACT TTA ATT GGT 24 96 

Asp Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu He Gly 
820 825 830 

S CAA GTA GAT AGA TTA AAA GAT AAA GTT AAT AAT ACA CTT AGT ACA GAT 2544 

Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 
835 840 845 

ATA CCT TTT CAG CTT TCC AAA TAC GTA GAT AAT CAA AGA TTA TTA TCT 2 5 92 

Ile Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
850 855 860 

ACA TTT ACT GAA TAT ATT AAG AAT ATT ATT AAT ACT TCT ATA TTG AAT 
Thr Phe Thr Glu Tyr Ile Lys Asn Ile Ile Asn Thr Ser Ile Leu Asn 
13 865 870 B75 

TTA AGA TAT GAA AGT AAT CAT TTA ATA GAC TTA TCT AGG TAT GCA TCA 
Leu Arg Tyr Glu Ser Asn Hie Leu Ile Asp Leu Ser Arg Tyr Ala Ser 
885 890 895 

AAA ATA AAT ATT GGT AGT AAA GTA AAT TTT GAT CCA ATA GAT AAA AAT 
Lys Ile Asn He Gly Ser Lys Val Asn Phe Asp Pro Ile Asp Lys Asn 
900 905 910 

CAA ATT CAA TTA TTT AAT TTA GAA AGT AGT AAA ATT GAG GTA ATT TTA 
Gin Ile Gin Leu Phe Asn Leu Glu Ser Ser Lys Ile Glu Val Ile Leu 
915 920 925 



20 



40 



50 



■ • . - • * i 4.1-v nin V-rtJ-i /VMM t_*-M 

Asn Asn Ser Lys Ile Tyr Ile Asn Gly Arg Leu Ile Asp Gin Lys Pro 
^ 1025 1030 1035 1040 



60 



65 



70 



2640 



2688 



2736 



2784 



AAA AAT GCT ATT GTA TAT AAT AGT ATG TAT GAA AAT TTT AGT ACT AGC 2 8 32 

M Lys Asn Ala Ile Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser 

930 935 940 

TTT TGG ATA AGA ATT CCT AAG TAT TTT AAC AGT ATA AGT CTA AAT AAT 2 880 

Phe Trp He Arg Ile Pro Lys Tyr Phe Asn Ser Ile Ser Leu Asn Asn 
J - 945 950 955 960 

GAA TAT ACA ATA ATA AAT TGT ATG GAA AAT AAT TCA GGA TGG AAA GTA 2 928 

Glu Tyr Thr Ile Ile Asn Cys Met Glu Asn Asn Ser Gly Trp Lvs Val 
965 970 975 

TCA CTT AAT TAT GGT GAA ATA ATC TGG ACT TTA CAG GAT ACT CAG GAA 2 976 

Ser Leu Asn Tyr Gly Glu Ile Ile Trp Thr Leu Gin Asp Thr Gin Glu 
980 985 990 

4^ ATA AAA CAA AGA GTA GTT TTT AAA TAC AGT CAA ATG ATT AAT ATA TCA 3 024 

Ile Lys Gin Arg Val Val Phe Lys Tyr Ser Gin Met Ile Asn lie Ser 
995 1000 1005 

GAT TAT ATA AAC AGA TGG ATT TTT GTA ACT ATC ACT AAT AAT AGA TTA 3 072 

Asp Tyr Ile Asn Arg Trp Ile Phe Val Thr Ile Thr Asn Asn Arg Leu 
1010 1015 1020 

AAT AAC TCT AAA ATT TAT ATA AAT GGA AGA TTA ATA GAT CAA AAA CCA 



3120 



ATT TCA AAT TTA GGT AAT ATT CAT GCT AGT AAT AAT ATA ATG TTT AAA 316 8 

He Ser Asn Leu Gly Asn Ile His Ala Ser Asn Asn Ile Met Phe Lys 
1045 1050 1055 

TTA GAT GGT TGT AGA GAT ACA CAT AGA TAT ATT TGG ATA AAA TAT TTT 3 216 

Leu Asp Gly Cys Arg Asp Thr His Arg Tyr Ile Trp Ile Lys Tyr Phe 
1060 1065 1070 

AAT CTT TTT GAT AAG GAA TTA AAT GAA AAA GAA ATC AAA GAT TTA TAT 3 264 

Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys Glu Ile Lys Asp Leu Tvr 
1075 1080 1085 

GAT AAT CAA TCA AAT TCA GGT ATT TTA AAA GAC TTT TGG GGT GAT TAT 3 31"? 

Asp Asn Gin Ser Asn Ser Gly Ile Leu Lys Asp Phe Trp Gly Asp Tyr 
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1090 109 5 



1100 



TTA CAA TAT GAT AAA CCA TAC TAT ATG TTA AAT TTA TAT GAT CCA AAT 
. Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 

5 1110 H15 1120 

AAA TAT GTC GAT GTA AAT AAT GTA GGT ATT AGA GGT TAT ATG TAT CTT 
Lys Tyr Val Asp Val Asn Asn Val Gly lie Arg Gly Tyr Met Tyr Leu 
10 1125 1130 1135 

AAA GGG CCT AGA GGT AGC GTA ATG ACT ACA AAC ATT TAT TTA AAT TCA 
Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser 
H40 H45 1150 

15 ACT TTG TAT AGG GGG ACA AAA TTT ATT ATA AAA AAA TAT GCT TCT GGA 

Ser Leu Tyr Arg Gly Thr Lys Phe lie lie Lys Lys Tyr Ala Ser afy 
1155 H60 1165 7 

AAT AAA GAT AAT ATT GTT AGA AAT AAT GAT CGT GTA TAT ATT AAT GTA 
Asn Lys Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val 
li/u 1175 1180 

vll vll ?** ^ G f TAT AGG TTA GCT ACT GCA TCA CAG GCA 

^5 T?rr 75 Sn LyS G1U Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 

185 1190 H95 1200 

Glv vll C^ f** rT A ^ GGA TTA GAA ATA CCT GAT GTA GGA AAT 

Gly Val Glu Lys lie Leu Ser Ala Leu Glu lie Pro Asp Val Gly Asn 

30 1205 1210 1215 

CTA ACT CAA GTA GTA GTA ATG AAG TCA AAA AAT GAT CAA GGA ATA ACA 
Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly lie Thr 
1220 122 5 1230 

AAT AAA TGC AAA ATG AAT TTA CAA GAT AAT AAT GGG AAT GAT ATA GGC 
Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp lie Gly 
1235 1240 1245 

40 Phi tT A ^? A ll T CAT CAG TTT ^ ATA GCT AAA CTA GTA GCA ACT 3792 

pfn Y ^ Phe ASn A5n 116 Ala L ^ Leu val Ser 

1250 1255 12 60 

AAT TGG TAT AAT AGA CAA ATA GAA AGA TCT ACT AGG ACT TTG GGT TGC 
Asn Trp Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cvs 

1265 1270 1275 y xiao 

Ter Trn Glu Phi y7 ° AT ° AT GGA TGG GGA GAA AGG CCA CTG 

Ser Trp Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 

50 1285 1290 1295 

TAA 



J} 



(2) INFORMATION FOR SEQ ID NO: 28: 

^ <i> SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1296 amino acids 
(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 



60 



65 



70 



(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Gin Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Glv 

5 10 15 

Val Asp He Ala Tyr lie Lys lie Pro Asn Val Gly Gin Met Gin Pro 

20 25 30 

Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro Glu Arg 
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3360 



3408 



3456 



3504 



3552 



3600 



3648 



3696 



3744 



3840 



3888 



3891 
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35 

Asp Thr Phe Thr Asn Pro 
50 



Ala Lys Gin Val 
65 

Asp Asn Glu Lys 



Arg He Tyr Ser 
100 

Arg Gly He Pro 
115 

Val He Asp Thr 
130 

Arg Ser Glu Glu 
145 

He Gin' Phe Glu 



Arg Asn 

Thr Phe 

Gly Ala 
210 

Leu He 
225 

Arg Val 
Glu Val 
Phe He 



Gly Tyr 
180 

Gly Phe 
195 



Gly Lys Phe Ala 



His Ala 



Phe Lys 



Ser Phe 
260 

Asp Ser 
275 



Lys Phe 
290 



Lys Asp He Ala 
Thr Ala 
Tyr Leu Leu Ser 



40 

Glu Glu Gly Asp Leu Asn 
55 6 o 

Pro Val Ser Tyr Tyr Asp Ser Thr 
70 75 

Asp Asn Tyr Leu Lys Gly Val Thr 
85 90 

Thr Asp Leu Gly Arg Met Leu Leu 
105 

Phe Trp Gly Gly Ser Thr He Asp 
120 

Asn Cys He Asn Val He Gin Pro 
135 140 

Leu Val He He Gly Pro 
155 

Ser Phe Gly His Glu Val 
170 

Thr Gin Tyr He Arg Phe 
185 

Ser Leu Glu Val Asp Thr 
200 

Thr Asp Pro Ala Val Thr 
215 220 

Arg Leu Tyr Gly He Ala 
235 

Thr Asn Ala Tyr Tyr Glu 
250 

Leu Arg Thr Phe Gly Gly 
265 

Glu Asn Glu Phe Arg Leu 
2B0 

Ser Thr Leu Asn Lys Ala 
295 300 



45 

Pro Pro Pro Glu 



Leu Asn 
150 

Cys Lys 
165 

Gly Ser 



Glu Glu 



Tyr Leu 

Lys Leu 

Thr Ser 
110 

Thr Glu 
125 

Asp Gly 
Ser Ala 
Leu Asn 



Ser Thr 
80 

Phe Glu 
95 

lie Val 
Leu Lys 
Ser Tyr 



Ser Pro 
190 

Asn Pro 
205 



Asp He 
160 

Leu Thr 
175 

Asp Phe 



Leu Leu 



Gly His 
230 

Val Asn 
245 

Glu Glu 

Leu Gin 



Leu Ala His Glu 



He Asn Pro Asn 
240 

Met Ser Gly Leu 
255 

His Asp Ala Lys 
270 

Tyr Tyr Tyr Asn 
285 

Lys Ser He Val 



Gly Thr 
305 



Ser Leu 
310 



Lys Phe Asp Lys 
340 

Asn Phe Val Lys 
355 

Phe Asp Lys Ala 
370 

Thr He Tyr Asp 
385 

Phe Asn Gly Gin 



Glu Asp 
325 

Leu Tyr 
Phe Phe 
Val Phe 



Gly Phe 
390 

Asn Thr 
405 



Gin Tyr Met Lys Asn Val 
315 

Thr Ser Gly Lys Phe Ser 
330 

Lys Met Leu Thr Glu lie 
345 

Lys Val Leu Asn Arg Lys 
360 

Lys He Asn He Val Pro 
^75 380 

Asn Leu Arg Asn Thr Asn 
395 

Glu He Asn Asn Met Asn 
410 



Phe Lys Glu Lys 
320 

Val Asp Lys Leu 
335 

Tyr Thr Glu Asp 
350 

Thr Tyr Leu Asn 
365 

Lys Val Asn Tyr 



Leu Ala Ala Asn 
400 

Phe Thr Lys Leu 
415 
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Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cy S Val Arg 
420 425 4^,0 a 

Gly He lie Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr Asn Lys 
J * 440 445 

Ala Leu Asn Asp Leu Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe 

455 460 

Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 
470 47 S 480 

He Thr ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn lie Ser Leu 
485 490 49s 

Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 Jos 510 

Glu Asn lie Ser lie Glu Asn Leu Ser Ser Asp He lie Gly Gin Leu 

520 525 

Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 

535 540 

Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 

550 555 560 

His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
565 570 5V5 

Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tvr Val Lys 
Jd0 585 590 

Lys val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 
" 6 00 605 

Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 

615 620 

Asp Lys lie Ala Asp lie Thr lie lie He Pro Tyr lie Gly Pro Ala 

630 635 6 4 0 

Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
645 650 6S5 

He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala 
660 665 670 

He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 
675 680 685 

Val Leu Thr Val Gin Thr lie Asp Asn Ala Leu Ser Lys Arg Asn Glu 

695 7Q 0 

Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys 
710 715 7 ^o 

Val Asn Thr Gin He Asp Leu lie Arg Lys Lys Met Lys Glu Ala Leu 
725 730 735 

Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 

? 45 7 50 

Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp 

760 765 

Leu Ser Ser Lys Leu Asn Glu Ser lie Asn Lys Ala Met He Asn lie 

775 780 
Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
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785 790 



7 95 800 



15 



He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 
5 805 810 sis 

Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Glv 
820 825 8 3o 

Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asd 
iU 835 840 845 

lie Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
850 855 860 

Thr Phe Thr Glu Tyr He Lys Asn lie He Asn Thr Ser He Leu Asn 
865 870 875 880 

Leu Arg Tyr Glu Ser Asn His Leu lie Asp Leu Ser Arg Tyr Ala Ser 
20 885 890 895 

Lys He Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn 
900 905 9X0 

Gin He Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu 
915 920 925 

Lys Asn Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser 
930 935 940 

30 Phe Trp lie Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn 

945 950 955 960 

Glu Tyr Thr lie He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val 
- S 965 970 975 

Ser Leu Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu 
980 985 990 

He Lys Gin Arg Val Val Phe Lys Tyr Ser Gin Met lie Asn He Ser 
4U "5 1000 1005 



45 



50 



Asp Tyr He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu 
1010 1015 1020 

Asn Asn Ser Lys He Tyr lie Asn Gly Arg Leu He Asp Gin Lys Pro 
1025 1030 1035 1040 

He Ser Asn Leu Gly Asn He His Ala Ser Asn Asn lie Met Phe Lys 
1045 1050 1055 

Leu Asp Gly Cys Arg Asp Thr His Arg Tyr He Trp lie Lys Tyr Phe 
1060 1065 1070 

Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr 
1075 1080 loss 

Asp Asn Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tvr 
10 90 1095 iioo 

Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 
H05 • mo ins use 

Lys Tyr Val Asp Val Asn Asn Val Gly He Arg Gly Tvr Met Tyr Leu 
H25 H30 ' 1135 

Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser 
H40 H45 1150 

Ser Leu Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly 
1155 1160 lies 



60 



65 
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Asn Lys Asp Asn lie Val Arg Asn Asn Asp Arg Val Tyr He Asn Val 
1170 H7S uso 

Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 
1185 1190 U95 120 0 

Gly Val Glu Lys He Leu Ser Ala Leu Glu lie Pro Asp Val Gly Asn 
1205 12 io 1215 

Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly lie Thr 
1220 1225 12 30 

Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp lie Gly 
1235 124 0 1245 ' 

Phe "a Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser 

j u 1255 - 



1260 



Asnjrp Tyr Asn Arg Gln^Ile Glu Arg Ser Ser Arg Thr Leu Gly Cys 

Ser Trp Glu Phe lie Pro Val Asp Asp Gly Tr P Gly Glu Arg Pro Leu 
1285 1290 1295 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D> TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 

CGCCATGGCT AGATTATTAT CTACATTTAC 

(2) INFORMATION FOR SEQ ID NO: 30: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GCAAGCTTCT TGACAGACTC ATGTAG 

(2) INFORMATION FOR SEQ ID NO: 31; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
'(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACCATG GGCCATCATC 1 
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ATCATCATCA 


. TCATCATCAT 


CACAGCAGCG 


GCCATATCGA 


AGGTCGTCAT 


ATGGCTAGCA 


inn 




TGGCTAGATT 


ATTATCTACA 


TTTACTGAAT 


ATATTAAGAA 


TATTATTAAT 


ACTTCTATAT 


"5 a n 


5 


TGAATTTAAG 


ATATGAAAGT 


AATCATTTAA TAGACTTATC 


TAGGTATGCA 


TCAAAAATAA 


300 




ATATTGGTAG 


TAAAGTAAAT 


TTTGATCCAA 


TAGATAAAAA 


TCAAATTCAA 


TTATTTAATT 


Jo 0 


1 0 

i \i 


TAGAAAGTAG 


TAAAATTGAG 


GTAATTTTAA 


AAAATGCTAT 


TGTATATAAT 


AGTATriTLTP 


42 0 




AAAATTTTAG 


TACTAGCTTT 


TGGATAAGAA 


TTCCTAAGTA 


TTTTAACAGT 


AT AAGTCT A A 


4 80 




ATAATGAATA 


TACAATAATA 


AATTGTATGG 


AAAATAATTC 


AGGATGGAAA 




540 


15 


ATTATGGTGA 


AATAATCTGG 


ACTTTACAGG 


ATACTCAGGA 


AATAAAACAA 


"UrtO 1 rtVJ ill 


600 




TTAAATACAG 


TCAAATGATT 


AATATATCAG 


ATTATATAAA 


CAGATGGATT 




660 


*>n 
_u 


TCACTAATAA 


TAGATTAAAT 


AACTCTAAAA 


TTTATATAAA 


TGGAAGATTA 


m * MVjM I v» AAA 


720 




AACCAATTTC 


AAATTTAGGT 


AATATTCATG 


CTAGTAATAA 


TATAATGTTT 




780 


25 


GTTGTAGAGA 


TACACATAGA 


TATATTTGGA 


TAAAATATTT 


TAATCTTTTT 


G AT A An A a t 


84 0 


TAAATGAAAA 


AGAAATCAAA 


GATTTATATG 


ATAATCAATC 


AAATTCAGGT 


Ml 1 il AAAAG 


900 




ACTTTTGGGG 


TGATTATTTA 


CAATATGATA 


AAC CAT ACTA 


TATGTTAAAT 


i. inirtl oM 1 \, 


960 




CAAATAAATA 


TGTCGATGTA 


AATAATGTAG 


GTATTAGAGG 


TTATATG TAT 




1020 




CTAGAGGTAG 


CGTAATGACT 


ACAAACATTT 


ATTTAAATTC 


AAGTTTGTAT 




1080 




AATTTATTAT 


AAAAAAATAT 


GCTTCTGGAA 


ATAAAGATAA 


TATTGTTAGA 


AATAATflATr 


114 0 


35 


GTGTATATAT 


TAATGTAGTA 


GTTAAAAATA 


AAGAATATAG 


GTTAGCTACT 




12 00 




AGGCAGGCGT 


AGAAAAAATA 


CTAAGTGCAT 


TAGAAATACC 


TGATGTAGGA 


HM 1 l_ i AAQj J L 


126 0 


40 


AAGTAGTAGT 


AATGAAGTCA 


AAAAATGATC 


AAGGAATAAC 




AAAATGAATT 


1320 




TACAAGATAA 


TAATGGGAAT 


GATATAGGCT 


TTATAGGATT 


TCATCAGTTT 


AATAATATAG 


1380 


45 


CTAAACTAGT 


AGCAAGTAAT 


TGGTATAATA 


GACAAATAGA 


AAGATCTAGT 


AGGACTTTGG 


1440 


GTTGCTCATG 


GGAATTTATT 


CCTGTAGATG 


ATGGATGGGG 


AGAAAGGCCA 


CTGTAATTAA 


1500 




TCTCAAACTA 


CATGAGTCTG 


TCAAGAAGCT 


TGCGGCCGCA 


CTCGAG 




1546 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
<D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 

Met His His His His His His Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION : /desc = " DNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

TATGCATCAC CATCACCATC A 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CATGTGATGG TGATGGTGAT GCA 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 



30 



35 



40 



35 



60 



(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1335 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:35: 

ATG CAT CAC CAT CAC CAT CAC ATG GCT CGT CTG CTG TCT ACC TTC ACT 
Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
15 10 15 



GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC AAC 
Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He Asn 
^ U 35 4 o 45 



ATC GGT TCT AAA GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC CAG 
lie Gly Ser Lys Val Asn Phe Asp Pro lie Asp Lys Asn Gin He Gin 
50 55 60 

CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT GCT 
Leu Phe Asn Leu Glu Ser Ser Lys lie Glu Val lie Leu Lys Asn Ala 
65 70 75 eo 

ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG ATC 
He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp He 
85 90 95 



70 



ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG AAC 
He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 



23 



46 



GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC TAC 9 6 

Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arq Tyr 
20 25 30 



144 



192 



240 



288 



AS I tT o° G *** TAC TTC TCC ATC TCT CTG GAA TAC ACC 336 

to Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 
100 105 no 



384 
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Kr oil nf- rT° I GG i CT CTG CAG GAC ACT CAG GAA ft TC AAA CAG 

I1S Trp Thr L6U Gln As P Thr Gln Glu He Lys Gin 
135 140 

5 CGT GTT GTA TTC AAA TAC TCT CAG ATG ATC AAC ATC TCT GAC TAC ATC 

Arg V.1 Val Phe Lys Tyr Ser Gln Met lie Asn lie Ser Asp Tyr lie 

150 155 160 

10 T T * GC l GG A T C TTC GTT ACC ATC ACC AAT CGT CTG AAT AAC TCC 

10 Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn aSS sir 

165 170 175 

AAA ATC TAC ATC AAC GGC CGT CTG ATC GAC CAG AAA CCG ATC TCC AAT 
Lys He Tyr lie Asn Gly Arg Leu He Asp Gln Lys Pro lie sir A^n 
180 165 19Q 

llu ril A^n n£ G r ^ ^ ATC ATG TTC ^ CTG ^ GGT 

Leu My Asn lie His Ala Ser Asn Asn He Met Phe Lys Leu Asp Gly 

2() 195 200 205 

TGT CGT GAC ACT CAC CGC TAC ATC TGG ATC AAA TAC TTC AAT CTG TTC 
Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Sn 25 III 

215 220 

25 GAC AAA GAA CTG AAC GAA AAA GAA ATC AAA GAC CTG TAC GAC AAC CAG 

Asp Lys Glu Leu Asn Glu Lys Glu lie Lys Asp Leu Tyr Asp Gln 

230 235 240 

TCC AAT TCT GGT ATC CTG AAA GAC TTC TGG GGT GAC TAC CTC car Tar 
•'0 Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Su Gln Tyf 

245 250 255 



65 



A A n f** l AC Z AC ATG CTG A* 7 CTG TAC GAT CCG AAC AAA TAC GTT 

Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr Val 
'60 265 270 



380 



AAA ATG AAT CTG CAG GAC AAC AAT GGT AAC GAT ATC GGT TTC ATC GGT 
Lys Met Asn Leu Gln Asp Asn Asn Gly Asn Asp He Gly Phe He Gly 
385 390 395 400 



432 



480 



528 



S76 



624 



6 72 



720 



816 



864 



912 



960 



Aon SI? a AAT GTA GGT ATC CGC GGT TAC ATG TAG CTG AAA GGT CCG 
Asp val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly Pro 

4(, ' 7S 280 285 

CGT GGT TCT GTT ATG ACT ACC AAC ATC TAC CTG AAC TCT TCC CTG TAC 
Arg Gly Ser Val Met Thr Thr Asn lie Tyr Leu Asn Ser Ser Leu Tyr 
-90 295 30Q 

*™ r GT A .k C T I C ATC ATC ™ C ^ TA C GCG TCT GGT AAC AAG GAC 

Aig Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys Asp 

3i0 315 320 

50 A T° u T T CGC AAC AA T GAT CGT GTA TA C ATC AAT GTT GTA GTT AAG 1008 

5U Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val Lys 

325 330 335 

AAC AAA GAA TAC CGT CTG GCT ACC AAT GCT TCT CAG GCT GGT GTA GAA 
Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gln Ala Gly Val Glu 
340 345 35 £ 

AAG ATC TTG TCT GCT CTG GAA ATC CCG GAC GTT GGT AAT CTG TCT CAG 
Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser Gln 
(,() 355 360 3 65 

vI A SIT ft ^ fAA TCC AAG AAC GAC CAG GGT ATC ACT AAC AAA TGC 
Val yal Val Met Lys Ser Lys Asn Asp Gln Gly He Thr Asn Lys Cys 



1056 



1104 



1152 



1200 



70 11° CAC £ AG 11° AA ° AAT ATC GCT AAA CTG GTT GCT TCC AAC TGG TAC 124 8 

/» Phe His Gln Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp Tyr 
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10 



20 



30 



j5 



40 



45 



50 



55 



f>() 



65 



70 











405 










410 






AAT 


CGT 


CAG 


ATC 


GAA 


CGT 


TCC 


TCT 


CGC 


ACT 


CTG 


GGT TGC 


Asn 


Arg 


Gin 


He 


Glu 


Arg 


Ser 


Ser 


Arg 


Thr 


Leu 


Gly Cys 








420 










425 






TTC 


ATC 


CCG 


GTT 


GAT 


GAC 


GGT 


TGG 


GGT 


GAA 


CGT 


CCG CTG 


Phe 


lie 


Pro 


Val 


Asp 


Asp 


Gly 


Trp 


Gly 


Glu 


Arg 


Pro Leu 






435 










440 






445 



415 



430 



AAGCTT 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
1 5 io 15 

Glu Tyr lie Lys Asn lie lie Asn Thr Ser lie Leu Asn Leu Arq r V r 
20 25 30 

Glu Ser Asn His Leu lie Asp Leu Ser Arg Tyr Ala Ser Lys He Asn 
35 40 45 

He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He Gin 
50 55 60 

Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 
65 7 ° 75 80 

He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp He 
8S 90 95 

Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 
100 105 no 

He lie Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 

Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys Gin 
130 135 140 

Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr He 
145 150 155 160 

Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn Ser 
165 170 i7 5 

Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser Asn 
180 185 190 

Leu Gly Asn lie His Ala Ser Asn Asn lie Met Phe Lys Leu Asp Gly 
19 5 200 205 

Cys Arg Asp Thr His Arg Tyr lie Trp He Lys Tyr Phe Asn Leu Phe 
210 215 220 

Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn Gin 
5 230 235 240 

Ser Asn Ser Gly lie Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin Tyr 
24 5 2S0 255 



1296 



1345 



1351 
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Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr Val 
260 265 270 

Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly Pro 
275 280 285 

Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu Tyr 
- 90 295 300 r 

Arg Gly Thr Lys Phe lie He Lys Lys Tyr Ala Ser Gly Asn Lys Asp 

310 315 320 

Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val Lys 
325 33 0 335 

Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val Glu 
340 345 350 

Lys lie Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser Gin 
J55 360 365 

Val Val Val Met Lys Ser Lys Asn Asp Gin Gly lie Thr Asn Lys Cys 

I//s Met Asn Leu Gin Asp Asn Aon Gly Asn Asp lie Gly Phe lie Glv 
* 390 395 400 

Phe His Gin Phe Asn Asn lie Ala Lys Leu Val Ala Ser Asn Trp Tyr 
405 410 415 

Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cys Sor Trp Glu 

425 430 

Phe lie Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION : /desc = " DNA M 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO : 3 7 : 

CGCATATGAA TATTCGTCCA TTGCATG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
* (A) DESCRIPTION: /desc = " DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GGAAGCTTGC AGGGCAATTA CATCATG 

(2) INFORMATION FOR SEQ ID NO: 39: 

Ii) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3876 base pairs 
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10 



40 



65 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TO POLOG Y : linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . 3873 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATG CCA GTT ACA ATA AAT AAT TTT AAT TAT AAT GAT CCT ATT GAT AAT 
Met Pro Val Thr lie Asn Asn Phe Asn Tyr Asn Asp Pro He Asp IE 

GAC AAT ATT ATT ATG ATG GAA CCT CCA TTT GCA AGG GGT ACG GGG AGA 
Asp Asn lie lie Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 
20 2 5 30 

^vr ?? T ll T J** A T° ACA GAT CGT ATT TGG ATA "A CCC GAA 

Tyr Tyr Lys Ala Phe Lys lie Thr Asp Arg He Trp lie He Pro Glu 

J5 40 45 

AGA TAT ACT TTT GGA TAT AAA CCT GAG GAT TTT AAT AAA AGT TCC rrr 
Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe £n Ser Ser Gly 

55 6o 



Ihl r? A n CT S? G CCA GTT ™ ^ GAA AAT GAG ACT ATA GAT ATA GGT 
Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp lie Gly 
!65 170 175 * 

ATA CAA AAT CAT TTT GCA TCA AGG GAA GGC TTT GGG GGT ATA ATG CAA 
lie Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly lie 51? Gin 

60 185 iso 

ATG AAA TTT TGT CCA GAA TAT GTA AGC GTA TTT AAT AAT GTT CAA GAA 
Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 



AAC AAA GGC GCA AGT ATA TTT AAT AGA CGT GGA TAT TTT TCA GAT CCA 
Asn Lys Gly Ala Ser lie Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 

** 1U 215 220 

GCC TTG ATA TTA ATG CAT GAA CTT ATA CAT GTT TTG CAT GGA TTA TAT 
Ala Leu lie Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
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48 



96 



144 



191 



M) nl ll T T T n GA GAT GTT TGT GAA TAT TAT GAT CCA GAT TAG TTA AAT 24 0 

lie Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

70 ? 5 80 



ACC AAT GAT AAA AAG AAT ATA TTT TTC CAA ACA TTG ATC AAG TTA TTT 
Thr Asn Asp Lys Lys Asn He Phe Phe Gin Thr Leu lie Lys Leu Phe 
85 90 95 

AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 
Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lvs Leu Leu Glu Met He 

n * r? T ?T A n° T TAT CTT GGA GAT AGA CGT GTT CCA "C GAA GAG 

ile Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 

115 120 12 5 

4? TTT AAC ACA AAC ATT GCT AGT GTA ACT GTT AAT AAA TTA ATT AGT AAT 

Phe Asn Thr Asn lie Ala Ser Val Thr Val Asn Lys Leu Tie Ser JiJ 
130 135 140 

SO Pro GlC ctu" vl? r? G ? GA ?** *** GGT ATT TTC GCA ^ T TTA ATA ATA 4 80 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He Ile 

145 150 155 160 



288 



336 



364 



432 



528 



576 



624 



672 
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225 230 235 2 4o 

GGC ATT AAA GTA GAT GAT TTA CCA ATT GTA CCA AAT GAA AAA AAA TTT 
. Gly lie Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lvs Phe 

245 250 255 

^ t T ? TCT ACA GAT ACT ATA CAG GCA GAA Gft A CTA TAT ACA TTT 

Phe Met Gin Ser Thr Asp Thr lie Gin Ala Glu Glu Leu Tyr Thr Pne 
|(i 260 265 270 

r?v C^t CCC AGC ATC ATA TCT CCT TCT ACA GAT AAA ACT ATC 

Gly Gly Gin Asp Pro Ser He lie Ser Pro Ser Thr Asp Lys aer lie 

275 280 285 

15 TAT GAT AAA GTT TTG CAA AAT TTT AGG GGG ATA GTT GAT AGA CTT AAC 

lyr Asp Lys Val Leu Gin Asn Phe Arg Gly lie Val Asp Arg Leu j£n 
° 29S 300 

AAG GTT TTA GTT TGC ATA TCA GAT CCT AAC ATT AAC ATT AAT ATA TAT 

-0 Lys val Leu Val Cys He Ser Asp Pro Asn lie Asn lie Asn He Jvr 

305 310 315 

AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC GTT GAA GAT TCT GAA GGA 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu 
32 5 330 335 

AAA TAT ACT ATA GAT GTA GAA ACT TTC AAT AAA TTA TAT AAA AGC TTA 
Lys Tyr Her lie Asp Val Glu Ser Phe Asn Lys Leu Tyr Lys Ser Leu 
;,() 340 345 3 5 o 

ATG TTA GGT TTT ACA GAA ATT AAT ATA GCA GAA AAT TAT AAA ATA AAA 
Met Leu Gly Phe Thr Glu He Asn lie Ala Glu Asn Tyr Lvs lie L^ 
355 3 6 o 365 - y 

Thr Ara A^I V" ^ T AGT GAT TCC TTA CGA CCA GTA ATA AAA 

Thr Aig Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lvs 

370 375 3B0 > 



470 475 480 

TAT ATA GGA AAT GAC TTT CCT ATA AAT GAA TTA ATT TTA GAT ACT GAT 

Tyr He Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asd 

" 485 490 ^ 495 H 

TTA ATA AGT AAA ATA GAA TTA CCA AGT GAA AAT ACA GAA TCA CTT ACT 

Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 

70 500 5 °5 510 



768 



816 



864 



9i; 



960 



1008 



1056 



1104 



1152 



40 I J 7 " GAT MT GAA ATC TAT ACT ATA GAG GAA GGG T TT AAT ATA 1200 

W Asn Leu Leu Asp Asn Glu He Tyr Thr lie Glu Glu Gly Phe Asn He 

385 390 395 4Q0 

TCT GAT AAA AAT ATG GGA AAA GAA TAT AGG GGT CAG AAT AAA GCT ATA 
oer Asp Lys Asn Met Gly Lys Glu Tyr Arg Gly Gin Asn Lys Ala He 
405 410 415 

AAT AAA CAA GCT TAT GAA GAA ATC AGC AAG GAG CAT TTG GCT GTA TAT 
Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
5() 420 425 430 

AAG ATA CAA ATG TGT AAA AGT GTT AAA GTT CCA GGA ATA TGT ATT GAT 
Lys He Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asd 
435 44 0 445 

" 3 GTC GAT AAT GAA AAT TTG TTC TTT ATA GCT GAT AAA AAT AGT TTT TCA 

Val Asp Asn Glu Asn Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 

450 455 46{) 

«) A A o A A n 73* T T f** ^ GAA AGA GTA GAA TAT AAT ACA CAG AAT AAT 144 0 

Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Asn Thr Gin Asn Asn 

4 D J A ~7 f\ 



1248 



1296 



1344 



1392 



1488 



1536 
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4(1 



45 



60 



65 



GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC GCT ATA AAA 
Asp Phe Asn Val Asp val Pro Val Tyr Glu Lys Gin Pro Ala He Lvs 
515 520 525 

AAA GTT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 
Lys Val Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
530 535 54Q 



650 655 

GAA CTT TTA ATA CCT GTA GTT GGA GTC TTT TTA TTA GAA TCA TAT ATT 
Glu Leu Leu lie Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr lie 
660 66 5 67Q 

GAC AAT AAA AAT AAA ATT ATT AAA ACA ATA GAT AAT GCT TTA ACT AAA 
Asp Asn Lys Asn Lys lie lie Lys Thr lie Asp Asn Ala Leu Thr Lvs 
675 680 685 

AGA GTG GAA AAA TGG ATT GAT ATG TAC GGA TTA ATA GTA GCG CAA TGG 

9 «i LyS Trp US ASP Met T Y r Gl y Leu Ile v *l Ala Gin Trp 

690 695 700 



AAA TAT AAT ATA TAT TCT GAA GAG GAA AAG TCA AAT ATT AAC ATC AAT 
Lys Tyr Asn Ile Tyr Ser Glu Glu Glu Lys Ser Asn Ile Asn Ile Asn 
740 745 750 

TTT AAT GAT ATA AAT TCT AAA CTT AAT GAT GGT ATT AAC CAA GCT ATG 
Phe Asn Asp lie Asn Ser Lys Leu Asn Asp Gly lie Asn Gin Ala Met 
755 760 765 

GAT AAT ATA AAT GAT TTT ATA AAT GAA TGT TCT GTA TCA TAT TTA ATG 
Asp Asn Ile Asn Asp Phe Ile Asn Glu Cys Ser Val Ser Tyr Leu Met 
770 775 7 80 



1584 



1632 



ACA TTT CCT CTA AAT ATA AGA GAT ATA ACT TTA ACA TCT TCA TTT GAT 168 0 
Thr Phe Pro Leu Asn Ile Arg Asp He Ser Leu Thr Ser Ser Phe Asp 
545 550 555 sec 



GAT GCA TTA TTA GTT TCT AGC AAA GTT TAT TCA TTT TTT TCT ATG GAT 
Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asd 
565 570 5?5 

TAT ATT AAA ACT GCT AAT AAA GTA GTA GAA GCA GGA TTA TTT GCA GGT 
Tyr lie Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Glv 

20 580 5fi 5 590 

TGG GTG AAA CAG ATA GTA GAT GAT TTT GTA ATC GAA GCT AAT AAA AGC 
Trp Val Lys Gin lie Val Asp Asp Phe Val lie Glu Ala Asn Ser 
595 600 605 

21 AGT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA 
Ser Thr Met Asp Lys Ile Ala Asp Ile Ser Leu Ile Val Pro Tyr lie 

610 615 620 

10 r? A 7™ ^ T ** T GTA GGA GAT GAA ACA GCT AAA GGA AAT TTT GAA 1920 

M Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 

b ^ b 630 635 640 

AGT GCT TTT GAG ATT GCA GGA TCC AGT ATT TTA CTA GAA TTT ATA CCA 
3 _ Ser Ala Phe Glu lie Ala Gly Ser Ser lie Leu Leu Glu Phe lie Pro 



1728 



1776 



1824 



1872 



1968 



>016 



2064 



2112 



CTC TCA ACA GTT AAT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT 2160 
>0 Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr Ile Lys Glu Gly Met Tyr 

705 710 715 720 

AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 
Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu Ile Ile Lys Tyr 
' J 7 25 730 735 



2208 



2256 



2304 



2352 



70 fAA ATG ATT CCA TTA GCT GTA AAA AAA TTA CTA GAC TTT GAT AAT 2400 

70 Lys Lys Met lie Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp 
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785 790 800 

Thr Su ^ ^ f T 7 A J TA AAT TAT ATA GAT GAA AAT AAA TTA TAT 244 8 
. Thr Leu Lys Lys Asn Leu Leu Asn Tyr lie Asp Glu Asn Lys Leu Tyr 

805 810 815 

TTA ATT GGA AGT GTA GAA GAT GAA AAA TCA AAA GTA GAT AAA TAC TTf 
Leu He Gly Ser Val Glu Asp Glu Lys Ser Lys Val ts P J£ Tyr leu 
|() 820 825 830 

AAA ACC ATT ATA CCA TTT GAT CTT TCA ACG TAT TCT AAT ATT GAA ATA 
Lys Thr lie lie Pro Phe Asp Leu Ser Thr Tyr Ser ill Qlu He 

840 845 



65 



CTA ATA AAA ATA TTT AAT AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT 
Leu lie Lys He Phe Asn Lys Tyr Asn Ser Glu lie Leu Asn ill 

855 860 



Asn «ec Asp lie Lys Asp lie Oly g7u V^ He Val Zn Tly oTu lie 
1025 1030 1035 1040 

Thr Dh T ?** 7 TA GAT GGT GAT GTA GAT AGA ACA CAA TTT ATT TGG ATG 

Thr Phe Lys Leu Asp Gly Asp Val Asp Arg Thr Gin Phe lie Trp Met 

1045 10S0 10 £ 5 



^ £r Phi tT T IT" ACG CAA TTA *»* CAA TCA AAT ATT AAA 

~ys Tyi Phe Ser lie Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lvs 

7() 1060 1065 10 70 



2496 



2592 



2640 



2688 



2736 



:784 



2B3; 



^0 ill 11 ^ 7 TA AGA TAT AGA GAT WT AAT ATA GAT TTA TCA GGA 

-U He Leu Asn Leu Arg Tyr Arg Asp Asn Asn Leu He Asp Leu Ser Glv 

870 875 680 

Tvr f* 0 w TA GAG GTA TAT GAT GGG GTC AAG "T AAT GAT AAA 

, s Tvr Gly nla Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp lCs 

885 890 89 | 

£1 G^n Phi 1 TA ^ T AGT TCA GCA GAT AGT AAG ATT AGA GTC ACT 

Asn G.n Phe Lys Leu Thr Ser Ser Ala Asp Ser Lys lie Arg Val Thr 

3() 900 905 910 

CAA AAT CAG AAT ATT ATA TTT AAT AGT ATG TTC CTT GAT TTT AGC GTT 
cln Asn in Asn He He Phe Asn Ser Met Phe Leu Asp III Ser Val 
915 920 925 

Ser Phe Trn tT^ f G ^ *** TAT AGG ™ GAT GAT ATA CAA AAT 

^he Tip He Arg He Pro Lys Tyr Arg Asn Asp Asp He Gin Asn 

935 94Q 

40 Itl u AT t** GAA TAT ACG ATA ATT AAT TGT ATG AAA AAT AAT TCA 2 8BO 

40 l\ r . He His Asn Glu Tyr Thr He He Asn Cvs Met Lys Asn Asn Ser 

950 955 9 60 

GCC TGG AAA ATA TCT ATT AGG GGT AAT AGG ATA ATA TGG ACC TTA ATT 
Gly Trp Lys lie Ser He Arg Glv Asn Arg He lie Trp Thr Leu ill 
965 970 975 

GAT ATA AAT GGA AAA ACC AAA TCA GTA TTT TTT GAA TAT AAC ATA AGA 
Ue Asn 01/ Lys Thr Lys Ser Val Phe Phe Glu Tvr Asn lie Arg 
50 980 98 5 ' 990 

GAA GAT ATA TCA GAG TAT ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT 
Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He ?S 
995 1000 1005 

55 £n £n 1 TG GAT ^ GCT ATT TAT ATT AAT GGC ACG TTA GAA TCA 

Asn Leu Asp Asn Ala Lys He Tyr He Asn Glv Thr Leu Glu Ser 
1010 1015 1020 

60 £n Me TG Ann tTT ^ SI *7* G ? A GAA GT T ATT G .T? AAT GGT GAA ATA 



2929 



!976 



3024 



307: 



3168 



3216 
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25 



60 



65 



Gin tT A l M A T T CAA TCA TAT AGC GAA TAC TTA AAA GAT TTT TGG 

Glu He Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 
1075 1080 108 5 

Git n CT 7 TA ATG TAT ** T GAA TAT TAT A ^G TTT AAT GCG GGG 

Y f n ° ^ LyS G1U Tyr T ^ r Met Phe Asn A ^ Glv 

1090 1095 noo 



CTA GAT TTG GTA CTT CAC CAT GAA GAG TGG AGA GTA TAT GCC TAT AAA 
Leu Asp Leu Val Leu His His Glu Glu Trp Arg Val Tyr Ala £yr JJJ 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 

70 6 10 15 



3264 



331; 



10 ^ T TCA TAT ATT *** CTA GTG AAA GAT T CA TCT GTA GGT GAA 3 360 

l« Asn Lys Asn Ser Tyr lie Lys Leu Val Lys Asp Ser Ser Val Gly Glu 6 ° 

1110 1120 

ATA TTA ATA CGT AGC AAA TAT AAT CAG AAT TCC AAT TAT ATA AAT "AT 
He Leu He Arg Ser Lys Tyr Asn Gin Asn Ser As^ lyr lie fttn Tyr 
11-5 i 130 113s 

AGA AAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT 
Aig Asn Leu Tyr lie Gly Glu Lys Phe lie lie Arg Arg Glu ler A^n 

2() 0 i 145 H50 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT 
Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu 2J lyl lie SJ 
-l" 1160 1165 



3408 



3456 



3504 



3 5 52 



3648 



T. AT TTT GAA CAG Gft A GAA AAA TTG TTT TTA TCT ATT ATA AC- rar 

•'0 Tyr Phe Lys Glu Gin Glu Glu Lys Leu Phe Lei Sel 111 He Ser £J 
1185 1190 H95 12 5o 

1ST £T GAA JT l AT AAG ACT ATA GAA ATA AAA GAA TAT GAT GAA CAG 
Ser Asn Glu Phe Tyr Lys Thr He Glu He Lys Glu Tvr Asp Glu Gin 
12 °5 1210 ' 1215 

ovo I CA I AT AGT TGT CAG TTG CTT TTT AAA AAA GAT GAA GAA ACT ACT 36 96 

Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 

40 1220 l 2 " 1230 

A^D A A n tT A f G ? A T TG A T T 5?T ATT CAT CGT TTC TAC GAA TCT GGA GTT 3 74 4 

Asp Asp lie Gly Leu He Gly He His Arg Phe Tyr Glu Ser Glv Val 
li35 1240 j.245 

4 ' feu Arc £0* f* 0 f** GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 3792 

12S0 Y yr VS A H 5 Tyr Phe ° yS 116 ^| Q LyS Trp T*/r Leu 

SO *** ^ AG GTA ^ AGG AAA CCA TAT AAG TCA AAT TTG GGA TGT AAT TGG ». n 

%U " '"^ \X S n n Pr ° TV ' r LyS Ser AS " LeU G1 ' ™ A^n Irp 3810 

1270 1 27 5 1280 

CAG TTT ATT CCT AAA GAT GAA GGG TGG ACT GAA TAA 
Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID NO: 40: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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Asp Asn lie lie Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 
20 25 30 

Tyr Tyr Lys Ala Phe Lys lie Thr Asp Arg He Trp lie lie Pro Glu 
35 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

lie Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 
6b 70 75 eo 

Thr Asn Asp Lys Lys Asn He Phe Phe Gin Thr Leu He Lys Leu Phe 
15 8S 90 95 

Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 no 

He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
" U 115 120 125 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
x3 ° 135 140 

2? Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 

1,15 15 ° 155 160 

Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp He Gly 
M) 165 i 70 1?5 

lie Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
i95 200 205 

Asn Lys Gly Ala Ser lie Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

Ala Leu lie Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
" 25 230 235 240 

Gly He Lys Val Asp Asp Leu Pro lie Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Thr He Gin Ala Glu Glu Leu Tvr Thr Plv> 
260 265 270 

Gly Gly Gin Asp Pro Ser lie lie Ser Pro Ser Thr Asp Lys Ser He 
275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 
290 295 300 

^ Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn 11* Tyr 

305 310 315 " 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly 
60 325 230 330 

Lys Tyr Ser He Asp Val Glu Ser Phe Asn Lys Leu Tyr Lys Ser Leu 
340 345 350 

Met Leu Gly Phe Thr Glu He Asn He Ala Glu Asn Tyr Lys He Lys 
°- 355 360 365 

Thr Arg Ala Ser Tyr Phe Set Asp Ser Leu Pro Pro Val Lys He Lvs 
370 375 3 8 o 

70 Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He 
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40 



45 



385 390 



395 400 



Ser Asp Lys Asn Met Gly Lys Glu Tyr Arg Gly Gin Asn Lys Ala lie 
405 410 415 

Asn Lys Gin Ala Tyr Glu Glu lie Ser Lys Glu His Leu Ala Val Tvr 
420 4,5 43 0 

Lys He Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asd 
435 440 ' 445 P 

Val Asp Asn Glu Asn Leu Phe Phe lie Ala Asp Lys Asn Ser Phe Ser 
430 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Asn Thr Gin Asn Asn 
65 470 475 4eo 

Tyr lie Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asd 
485 490 ^ 495 f 

Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala lie Lvs 
515 520 525 

Lys Val Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Lou Tyr Ser Gin 



540 



Thr Phe Pro Leu Asn He Arg Asp He ser Leu Thr Scr Ser Phe Asp 

550 555 56 o 

Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Civ 
580 585 590 

Trp Val Lys Gin He Val Asp Asp Phe Val He Glu Ala Asn Lys Ser 
595 eoo 605 

Ser Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr lie 
610 615 6 20 

Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 

630 635 6 4o 

Ser Ala Phe Glu lie Ala Gly Ser Ser He Leu Leu Glu Phe lie Pro 
645 650 

Glu Leu Leu lie Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 
660 665 670 

Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
b/S 680 6 8 5 

Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trp 



" 700 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tvr 

710 7 15 720 

Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lvs Tvr 
725 730 735 ' 

Lys Tyr Asn lie Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 



740 745 



750 



Phe Asn Asp lie Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
s 760 765 
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ASP ^ Ile Asn As P Phe Iie Asn Cys Ser Val Ser Tyr Leu Met 

770 775 780 

Lys Lys Met Ile Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 
785 790 795 H goo 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr Ile Asp Glu Asn Lys Leu Tyr 
805 810 815 y 

Leu lie Gly Ser Val Glu Asp Glu Lys Ser Lys Val Asp Lys Tyr Leu 
820 825 830 

Lys Thr Ile Ile Pro Phe Asp Leu Ser Thr Tyr Ser Asn Ile Glu lie 
|5 835 840 845 

Leu lie Lys lie Phe Asn Lys Tyr Asn Ser Glu Ile Leu Asn Asn Ile 
850 855 860 

■><) r« LSU Ar9 Tyr Ar9 ASP Asn Aan Leu Ile As P Leu ser Gly 

865 870 875 880 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys 
885 8 90 S9 l 

2* Asn Gin Phe Lys Leu Thr Ser Ser Ala Asp Ser Lys Ile Arg Val Thr 

900 905 910 

Gin Asn Gin Asn Ile lie Phe Asn Ser Met Phe Leu Asp Phe Ser Va] 
30 915 920 925 

Ser Phe Trp lie Arg Ile Pro Lys Tyr Arg Asn Asp Asp lie Gin Asn 
930 935 940 

^ 1*1 Ue HiS ASn G1U Tyr Thr Ile Ile Asn C V S Met A ^ Asn ser 

Jiib 950 955 9 6 o 

Gly Trp Lys lie Ser Ile Arg Gly Asn Arg He lie Trp Thr Leu lie 
965 970 975 

Asp Ile Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn II* Arc 
980 985 990 " 

Glu Asp Ue Ser Glu Tyr lie Asn Arg Trp Phe Phe Val Thr Thr 
995 1000 1005 

Ann Asn Leu Asp Asn Ala Lys Ile Tyr lie Asn Glv Thr Leu Glu Ser 
1010 1015 1020 

Asn Met Asp lie Lys Asp lie Gly Glu Val Ile Val Asn Gly Glu Ile 
• U i02b 1Q 30 1035 10 40 

Thr Phe Lys Leu Asp Gly Asp Val Asp Arg Thr Gin Phe Ile Trp Met 
1045 1050 1055 

Lys Tyr Phe Ser Ile Phe Asn Thr Gin Leu Asn Gin Ser Asn II*. Lys 
1060 1065 1070 

Glu Ile Tyr Lys Ile Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Ph* Trp 
1075 1080 1085 " 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Glv 
1090 1095 noo 

ftS ^ c LyS ASn Ser Tyr Ue Lys Leu Val Lys As P Ser Ser Val Gly Glu 

°- 1105 IHO 1115 H20 

lie Leu Ile Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr lie Asn Tyr 

1130 H35 

"70 Arg Asn Leu Tyr Ile Gly Glu Lys Phe lie Ile Arg Arg Glu Ser Asn 

- 290 - 



40 
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Ser Gin Ser lie Asn Asp Asp lie Val Arg Lys Glu Asp Tyr He His 
HS5 H60 ii 6 5 

Leu Asp Leu Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lvs 
H70 H75 1180 

Tyr Phe Lys Glu Gin Glu Glu Lys Leu Phe Leu Ser He He Ser Asd 
1185 1190 H95 12 00 

Ser Asn Glu Phe Tyr Lys Thr lie Glu lie Lys Glu Tyr Asp Glu Gin 
1205 1210 12 i5 

Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

Asp Asp lie Gly Leu He Gly lie His Arg Phe Tyr Glu Ser Gly Val 
1235 1240 12 45 

Leu Arg Lys Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trv Tyr Leu 
1-50 1255 1260 

Lys Glu Val Lys Arg Lys Pro Tyr Lys Ser Asn Leu Cly Cvs Asn Trp 
1265 1270 1275 * i 2 5o 

Gin Phe lie Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 76 base oairs 
<B) TYPE: nucleic acid* 
EC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

Cli) MOLECULE TYPE: DNA (genomic) 

Ux) FEATURE: 

(A) NAME /KEY : CDS 
(B> LOCATION: 1 . . 3873 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

mI? d CA w T T £S A A T A AAT TTT TAT ^ T GAT CCT ATT GAT AAT 

Met Pro Val Thr lie Asn Asn Phe Asn Tyr Asn Asp Pro lie Asp Asn 

15 10 is 

£Z tV tT T m TG ATG GAG CCT CCA TTT GCG AGA GGT A ^G GGG AGA 
Asn Asn lie He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arq 

20 25 30 

TAT TAT AAA GCT TTT AAA ATC ACA GAT CGT ATT TGG ATA ATA CCC PA* 
Tyr Tyr Lys Ala Phe Lys lie Thr Asp Arg ill £o lie lie Pro Glu 
35 40 * 45 

Aro ?*I Thr ph' r? A *** C ° T GAG GAT TTT AAA AGT TCC GGT 

Arg Tyr Ihr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 

50 5 $ 60 

jll IF ^ T ^ GA GAT GTT TGT GAA TAT TAT GAT C <^ GAT TAC TTA AAT 24 0 

lie Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

b 70 7S qo 



Thr ^ f AT AAG AAT ATA TTT TTA CAA ACA *™ ATC AAG TTA TTT 

Thr Asn Asp Lys Lys Asn lie Phe Leu Gin Thr Met lie Lys Leu Phe 

85 90 95 

AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 



48 



96 



144 



191 



2B8 



336 



10 



30 



60 
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Asn Arg lie L ys Set Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 110 

ATA AAT GGT ATA CCT TAT CTT GGA GAT AGA CGT GTT CCA CTC GAA GAP 
He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Su SJ 

120 125 

TTT AAC ACA AAC ATT GCT AGT GTA ACT GTT AAT AAA TTA ATr apt a at 
Ph. Asn Thr Asn lie Ala Ser Val Thr Val ££n ^ I™ S 

135 140 

S SJ SI? SK 25 E £ SJ K S5 S JE Z2 S ?2 

150 155 160 



.i.l 235 24 0 

GGC ATT AAA GTA GAT GAT TTA CCA ATT GTA CCA AAT GAA AAA AAA TTT 

Oly He Lys val Asp Asp Leu Pro lie Val Pro Asn Glu iji v /s Vhl 

245 250 255 



Phi m ° TCT ACA GAT GCT ATA CAG °CA GAA GAA CTA TAT ACA TTT 

Phe Met Gin Ser Thr Asp Ala lie Gin Ala Glu Glu Leu Tyr Thr pK 
'60 265 270 

CAA GAT CCC AGC ATC ATA ACT CCT TCT ACG GAT AAA AGT ATr 
4> Gly Gly Gin Asp Pro Ser lie lie Thr Pro Ser Thr Asp tyt SeT lie 

<- 75 280 285 

Tyl A A d £££ SIT T TG , CAA AAT TTT AGA GGG ATA GTT GAT AGA CTT AAC 
S() Vr A ^ L/S Val Leu Gln Asn phe Af g Gly He Val Asp Arg Leu Asn 

295 300 

AAG GTT TTA GTT TGC ATA TCA GAT CCT AAC ATT AAT ATT AAT ATA TAT 
Lys Val Leu Val Cys lie Ser Asp Pro Asn lie Asn 111 £n lie Tyr 
55 310 31 5 320 

AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC GTT GAA GAT TCT GAP m 
Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu ts P Sel Glu Gly 
325 330 r 



335 



AAA TAT AGT ATA GAT GTA GAA AGT TTT GAT AAA TTA TAT AAA AGC TTA 
1-y. Tyr Ser lie Asp Val Glu Ser Phe Asp Lys Leu IyT Id sir 21 
J40 34 5 350 



384 



43: 



480 



III ?M A n CT GGG CCA GTT TTA MT GAA AAT GAG ACT ATA GAT ATA GGT 

Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr lie AsJ Ue S3 528 

165 170 1?5 y 

lI A 5 AT V GCA TCA AGG GAA GGC TTC GGG GGT ATA ATG CAA 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He £t 5?6 

185 lgo 

ATG AAG TTT TGC CCA GAA TAT GTA AGC GTA TTT AAT AAT GTT PA A raa 
Met Lys Phe Cys Pro Glu Tyr Val Ser Val p£ JJJ J£ SIT G^ G^ 

200 205 

AAC AAA GGC GCA AGT ATA TTT AAT AGA CGT GGA TAT TTT TCA GAT CCA 
A„n Lys oly Ala Ser lie Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 

?ff JJ G tT A T TA ATG CAT GAA CTT ATA GAT GTT TTA CAT GGA TTA TAT 
Ala Lou He Leu Met His Glu Leu lie His Val Leu His Gly 2u lyl 



624 



672 



720 



768 



816 



864 



9i; 



960 



1008 



1056 



« K JS SJ JSI S £ £ s s SI £1 S £ % »•« 

5 360 365 

ACT AGA GCT TCT TAT TTT AGT GAT TCC TTA CCA CCA GTA AAA ATA AAA 
70 S Ala S6r TVV Phe Ser As ? S ^ ^u Pro Pro Val Lys lie J£ 1152 

375 380 
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20 



60 



65 



AAT TTA TTA GAT AAT GAA ATC TAT ACT ATA GAG GAA GGG TTT AAT ATA 
Asn Leu Leu Asp Asn Glu lie Tyr Thr lie Glu Glu Gly Phe Asn lie 
385 390 395 400 

TCT GAT AAA GAT ATG GAA AAA GAA TAT AGA GGT CAG AAT AAA GCT ATA 
Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Gly Gin Asn Lys Ala lie 
405 410 415 



AAG ATA CAA ATG TGT AAA AGT GTT AAA GCT CCA GGA ATA TGT ATT GAT 
)5 Lys lie Gin Mec Cys Lys Ser Val Lys Ala Pro Gly lie Cys lie Asp 



445 



GTT GAT AAT GAA GAT TTG TTC TTT ATA GCT GAT AAA AAT AGT TTT TCA 
Val Asp Asn Glu Asp Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 

4 50 455 



460 



TGG GTG AAA CAG ATA GTA AAT GAT TTT GTA ATC GAA GCT AAT AAA AGC 
Trp Val Lys Gin He Val Asn Asp Phe Val lie Glu Ala Asn Lys Ser 
595 600 6 05 

AAT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA 
Asn Thr Met Asp Lys lie Ala Asp lie Ser Leu He Val Pro Tyr He 
610 615 620 

GGA TTA GCT TTA AAT GTA GGA AAT GAA ACA GCT AAA GGA AAT TTT GAA 
Gly Leu Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu 
625 630 635 6 40 

AAT GCT TTT GAG ATT GCA GGA GCC AGT ATT CTA CTA GAA TTT ATA CCA 
Asn Ala Phe Glu lie Ala Gly Ala Ser He Leu Leu Glu Phe He Pro 
645 650 655 



1200 



1248 



AAT AAA CAA GCT TAT GAA GAA ATT AGC AAG GAG CAT TTG GCT GTA TAT 12 96 

10 Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 

420 425 430 



1344 



1392 



1440 



1488 



GAT GAT TTA TCT AAA AAC GAA AGA ATA GAA TAT AAT ACA CAG AGT AAT 
Asp Asp Leu Ser Lys Asn Glu Arg He Glu Tyr Asn Thr Gin Ser Asn 
465 470 475 4 80 

25 TAT ATA GAA AAT GAC TTC CCT ATA AAT GAA TTA ATT TTA GAT ACT GAT 

Tyr He Glu Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Aso 
485 490 495 

If) rT A *** A ™ GAA TTA CCA AGT GAA ACA GAA TCA CTT ACT 1536 

Leu lie Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC GCT ATA AAA 
Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lvs Gin Pro Ala He Lys 
-° 515 520 * 525 

AAA ATT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 
Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr S**r Gin 
4() 530 535 540 

ACA TTT CTC TTA GAT ATA AGA GAT ATA AGT TTA ACA TCT TCA TTT GAT 
Thr Phe Leu Leu Asp He Arg Asp lie Ser Leu Thr Ser Ser Phe Asp 
545 550 555 5 6 o 

4 3 GAT GCA TTA TTA TTT TCT AAC AAA GTT TAT TCA TTT TTT TC^ ATG GAT 

Asp Ala Leu Leu Phe Ser Asn Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

>l) l AT A T T ACT GCT ^ *** GTG GTA GAA GCA GGA TTA TTT GCA GGT 1776 

><> Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 

580 585 590 



1584 



1632 



1680 



1728 



1824 



1872 



1920 



1968 



70 T T T A ?T A CCT GT ? GTT GGA GCC TTT TTA TTA GAA T CA TAT ATT 2016 

n) Glu Leu Leu He Pro Val Val Gly Ala Phe Leu Leu Glu Ser Tyr lie 
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66 ° 665 



670 



15 



65 



700 



I CA GTT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT 

~eu Sec Thr Val Asn Thr Gin Phe Tyr Thr lie Lys Glu Gly Tyr 



895 



91.0 



CAA AAT CAG AAT ATC ATA TTT AAT ACT GTG TTC CTT GAT TTT AGC TTT 
Gxn Asn Gin Asn lie Uo Phe Asn Ser Val Phe Leu Asp Phi Ser vll 

^ 920 925 

qpr Ph T xT A AGA ATA CCT TAT *M AAT GAT GGT ATA CAA AAT 

Ser Phe Trp lie Arg He Pro Lys Tyr Lys Asn Asp Gly lie SS S 

7() 935 940 
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2064 



Asn ^1 f** ^ T *** ATT ATT *** ACA ATA GAT AAT GCT TTA ACT AAA 
Asp Asn Lys Asn Lys lie lie Lys Thr lie Asp Asn Ala Uu J£ J£ 

680 685 

AGA AAT GAA AAA TGG AGT GAT ATG TAG GGA TTA ATA GTA err ran tp, 
Arg Asn Glu Lys Trp Ser Asp Mot: Tyr Gly llu ?S £J ™J ^ 



2160 



710 'IS 720 

AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 
-vs Aia Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu lie lie Lys lyr 
725 7 30 735 

20 Arg lyl JE He I" 2?* GAA **° TCA A " AAC ATC GAT 

9 1 ASn Tyr Sei Glu L ^ Glu Lys Ser Asn lie Asn lie Asp 

AAT ? AT A T A *** TCT *** CTT GAG GGT ATT AAC CAA GCT ATA 

Phe Asn Asp lie Asn Ser Lys Leu Asn Glu Gly il e Kn G^ All lie 
3 760 765 

GAT AAT ATA AAT AAT TTT ATA AAT GGA TGT TCT GTA TCA TAT TTA ATC 
«sp Asn lie Asn Ann Phe lie Asn Gly Cys Ser Val Ser Tyl llu 
3() 775 780 

fCt ATG tT T CCA TTA GCT GTA GAA *** TTA CTA GAC TTT GAT AAT 
Lys Met He Pro Ala Val Glu Lys Leu Leu Asp Phe ts P £1 

795 800 

Thr llu AAT J TG TTA AAT TAT ATA GAT GAA AAT AAA TTA TAT 

Tm Leu L/s Lys Asn Leu Leu Asn Tyr Ile Asp Glu Asn Lys Leu Tvr 

805 810 815 

40 ? TG A T T GGA AGT GCA GAA TAT GAA AAA TCA AAA GTA AAT AAA TAC TTC 

40 - lie Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val A^n Si lyr IT 

820 825 a30 

AAA ACC ATT ATG CCG TTT GAT CTT TCA ATA TAT ACC AAT GAT ACA ATA 
4 . Lys Thr lie Met Pro Phe Asp Lou Ser lie Tyr Thr Asn Asp Thr lie 

BJS 840 845 

OTA ATA GAA ATG TTT AAT AAA TAT AAT AGC GAA ATT TTA AAT AAT AT~ 
Leu lie Glu Met Phe Asn Lys Tyr Asn Ser Glu lie lei Ifn HI lie 
5() 855 860 

ATC TTA AAT TTA AGA TAT AAG GAT AAT AAT TTA ATA GAT TTA TCA CCA 
lie Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu lie Asp lit sS Gly 

0 875 B80 

l A l r? G G ? A f* 6 GTA GAG GTA TAT GAT GGA GTC GAG CTT AAT GAT AAA 2 6 BR 

Tvi Gly Ala Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp £yt 2688 

885 



2206 



2256 



2304 



2 3 51! 



2400 



2448 



>544 



2592 



2640 



60 r^ ll T TTA ACT AGT TCA GCA MT A ^ AAG ATT AGA GTG ACT 

} ASn ° ln Phe ^ ^ Ser Ala Asn Ser Lys lie Arg Val Thr 

900 



2784 



2832 
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15 



20 



25 



40 



60 



65 



TAT ATT CAT AAT GAA TAT ACA ATA ATT AAT TGT ATG AAA AAT AAT TCG 
Tyr lie His Asn Glu Tyr Thr He lie Asn Cys Met Lys Asn Asn Ser 
945 950 955 9 6 o 

C?v t™ I CT A T T AGG GGT AGG ATA ATA TGG TTA ATT 

Gly Trp Lys lie Ser He Arg Gly Asn Arg lie lie Trp Thr Leu He 

96 5 970 975 



GAA GAT ATA TCA GAG TAT ATA AAT AGA TGG TTT TTT GTA ACT ATT APT 
Glu Asp lie Ser Glu Tyr lie Asn Arg Trp Phe Phi vlt J£ Til THr 
995 1000 10 05 

AAT AAT TTG AAT AAC GCT AAA ATT TAT ATT AAT GGT AAG CTA GAA TCA 
^r, Leu Asn Asn Ala Lys He Tyr lie Asn Gly Lys Leu gTu Ser 
1UAU 1015 1020 

AAT ACA GAT ATT AAA GAT ATA AGA GAA GTT ATT GCT AAT GGT GAA ATA 
Asn Thr Asp lie Lys Asp He Arg Glu Val He Ala Asn Gly Glu" He 
U " 1030 1035 1040 

ATA TTT AAA TTA GAT GGT GAT ATA GAT AGA ACA CAA TTT ATT TGG ATG 
He Phe Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met 
1045 1050 10 55 



GAA AGA TAT AAA ATT CAA TCA TAT AGC GAA TAT TTA AAA GAT TTT TGG 

35 T^ L> ' S Ue Gln Ser Tyr Ser Glu T ^ Leu Lys Asp Phi £p 

J 1075 1O80 1085 P P 

GGA AAT CCT TTA ATG TAC AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG 

y A n=n L€U MeC Tyr AStl LyS Glu T V r T V r Met Phe Asn Ala Gly 
iu»u 1095 



1100 



CTA GAT TTT TTT AAT TTA AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA 
Leu Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr 

1170 1175 y " 



1180 



TAT TTT AAG AAA GAG GAA GAA AAA TTG TTT TTA GCT CCT ATA AGT PAT 

Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu All Pro lie Ser AsJ 
1190 1195 120 G 

TCT GAT GAG TTT TAC AAT ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG 

Ser Asp Glu Phe Tyr Asn Thr lie Gin lie Lys Glu Tyr AsJ G^ Gin 

1205 1210 12 i5 



2880 



2928 



if) a™ rT A ^ T S? A *** ACC TCG GTA TTT TTT GAA TAT AAC ATA AGA o 976 

IU Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn lie Arq 
980 gas 990 y 



3024 



3072 



3120 



3168 



10 't T Z C AGT ATT TTT ACG GAA TTA AGT CAA TCA AAT ATT GAA 30 16 

M Lys Tyr Phe Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn lie Glu 

1060 10 65 1070 



3264 



3312 



3360 



3408 



AAT AAA AAT TCA TAT ATT AAA CTA AAG AAA GAT TCA CCT GTA GGT GAA 
1105 ^io^ 2 ^ ^ ii? 5 Ser Pr ° Val Gly GlU 

4? ATT TTA ACA CGT AGC AAA TAT AAT CAA AAT TCT AAA TAT ATA AAT TAT 

He Leu Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lvs Tvr He Asn Tyr 
H25 U30 * 1135 

AGA GAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT 3 4 56 

AL " 9 As P Leu TyV le G1 V Lys Phe He lie Arq Arg Lys Ser £JJ 

1140 H45 * 1150 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT 
s . Ser Gin Ser lie Asn Asp Asp lie Val Arg Lys Glu Asp Tyr lie Tyr 

1155 1160 1165 



3504 



3552 



3600 



3648 
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3696 



3744 



3792 



3840 



CCA ACA TAT ACT TGT CAG TTG CTT TTT AAA AAA GAT GAA GAA ACT ACT 
Pro Thr Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

GAT GAG ATA GGA TTG ATT GGT ATT CAT CGT TTC TAC GAA TCT CPA ltt 
Asp Glu lie Gly Leu lie Gly He His Arg lie T^r G^ Ser Gly ill 
1235 1240 1245 

vlt III TAT *** GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 

Val Phe Glu Glu Tyr Lys Asp Tyr Phe Cys lie Ser Lys Trp Tyr Leu 

1255 X260 

AAA GAG GTA AAA AGG AAA CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG 
Lys Glu Val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly lyl J£ Trp 

1270 1275 12 BQ 

CAG TTT ATT CCT AAA GAT GAA GGG TGG ACT GAA TAA , a7fi 
Gin Phe lie Pro Lys Asp Glu Gly Trp Thr Glu 6 
12 85 1290 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

t'ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Pro Val Thr lie Asn Asn Phe Asn Tyr Asn Asp Pro lie Asp Asn 
1 5 io 15 

Asn Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arq 
20 25 30 

Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg lie Trp lie lie Pro Glu 
i5 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 

lie Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

70 15 80 

Thr Asn Asp Lys Lys Asn lie Phe Leu Gin Thr Met lie Lys Leu Phe 
85 90 95 

Asn Arg He Lys Ser Lys Pro Leu Glv Glu Lys Leu Leu Glu Met. He 
100 105 no 

He Asn Gly lie Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
115 120 125 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu lie Ser Asn 
0 l 3 ^ 140 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He lie 
M5 150 155 iso 
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10 



25 



Phe GXy Pro Gly Pro Val Leu Asn Glu Asn Glu Thr lie Asp lie Gly 
165 170 175 

lie Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly lie Met Gin 
180 las 19Q 

Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser lie Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
*- 10 215 220 

Ala Leu lie Leu Met His Glu Leu lie His Val Leu His Gly Leu Tvr 

230 235 2 4o 

Gly lie Lys Val Asp Asp Leu Pro lie Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Ala lie Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly Gin Asp Pro Ser lie lie Thr Pro Ser Thr Asp Lys Ser lie 
-75 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly lie Val Asp Arg Leu Asn 
" 90 295 300 

Lys Val Leu Val Cys He Ser Asp Pro Asn lie Asn lie Asn lie Tyr 
305 Jin 1 

30 315 3 20 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Glv 
325 33 0 335 

Lys Tyr ser lie Asp Val Glu Ser Phe Asp Lys Leu Tvr Lvs Ser Leu 
340 345 - 35Q 

Met Phe Gly Phe Thr Glu Thr Asn lie Ala Glu Asn Tyr Lys lie Lys 
355 360 365 Y 

40 Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys lie Lvs 

370 375 3 8 o 

Asn Leu Leu Asp Asn Glu lie Tyr Thr lie Glu Glu Gly Phe Asn He 

45 390 395 400 

Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Glv Gin Asn Lvs Ala lie 
405 410 • ■ 415 

Asn Lys Gin Ala Tyr Glu Glu lie Ser Lys Glu His Leu Ala Val Tyr 
420 405 430 

Lys lie Gin Met Cys Lys Ser Val Lys Ala Pro Glv lie Cys He Asp 
43 5 440 445 

Val Asp Asn Glu Asp Leu Phe Phe lie Ala Asp Lys Asn Ser Phe Ser 
4=>0 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg He Glu Tyr Asn Thr Gin Ser Asn 

60 470 475 480 

Tyr lie Glu Asn Asp Phe Pro lie Asn Glu Leu lie Leu Asp Thr Asp 
485 4 9o 4g5 



50 



5.1 



65 



70 



Leu lie Ser Lys lie Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala lie Lvs 
515 5 20 525 

Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
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10 



1.5 



40 



45 



60 



530 535 540 

Thr Phe Leu Leu Asp lie Arg Asp He Ser Leu Thr Ser Ser Phe Asd 

550 555 56 b 

Asp. Ala Leu Leu Phe Ser Asn Lys Val Tyr Ser Phe Phe Ser Met Asp 

565 570 5?5 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 

580 585 ' 



590 



Trp Val Lys Gin lie Val Asn Asp Phe Val He Glu Ala Asn Lvs Ser 

595 600 



605 



Asn Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr He 
610 615 620 7 

Gly Leu Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu 

630 635 640 

Asn Ala Phe Glu lie Ala Gly Ala Ser He Leu Leu Glu Phe He Pro 
64S 650 655 

Glu Lou Leu He Pro Val Val Gly Ala Phe Leu Leu Glu Ser Tyr He 
660 665 6 7o 

Asp Asn l.ys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 685 

Arg Asn Glu Lys Trp Ser Asp Met Tyr Gly Leu He Val Ala Gin Trp 

695 700 K 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lvs Glu Gly Met Tyr 

710 7 15 720 

Lys A Ja Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lv. T-r 
? 25 730 7 j 5 ' 

Arg Tyr Asn lie Tyr Ser Glu Lys Glu Lys Ser Asn He Asn lie Asd 
740 -"15 750 

Phe Asn Asp He Asn Ser Lys Leu Asn Glu Uly He Asn Gin Ala He 
7j5 "'eo 765 

Asp Asn He Asn Asn Phe He Asn Gly Cys Ser Val Ser Tyr Leu Met 
u 7 75 7 80 

Lys Lys Met He Pro Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn 

50 90 795 600 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asp Glu Asn Lys Leu Tyr 

805 810 8i5 

Leu He Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu 
820 825 830 

Lys Thr lie Met Pro Phe Asp Leu Ser He Tyr Thr Asn Asp Thr lie 
335 840 8 4 5 

Leu He Glu Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
850 855 860 

He Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Glv 

65 870 875 • 880 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lvs 
885 890 " 



895 



70 



Asn Gin Phe Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr 
900 905 910 
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Gin Asn Gin Asn lie lie Phe Asn Ser val Phe Leu Asp Phe Ser Val 
y15 920 925 

Ser Phe Trp He Arg lie Pro Lys Tyr Lys Asn Asp Gly lie Gin Asn 

935 94 0 

Tyr lie His Asn Glu Tyr Thr He lie Asn Cys Met Lys Asn Asn Ser 
9S0 955 g60 

Gly Trp Lys He Ser lie Arg Gly Asn Arg lie lie Trp Thr Leu He 
965 970 975 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn fie Arg 
9fl0 985 990 y 

Glu Asp lie ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 

1000 1005 

Asn Asn Leu Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser 

101 5 1020 

Asnjhr Asp He Lys AspUe Arg Glu Val U, Wa Asn Gly Glu He 

He Phe Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe lie Trp Met 
1045 1050 10 g 5 

Lys Tyr Phe Ser lie Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu 
1060 1065 i070 

Glu Arg Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Pho Trp 

■ LU '=' 1080 



1085 



Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 

1095 1100 

Asn^Lys Asn Ser Tyr lie Lys Leu Lys Lys Asp Ser Pro Val Gly Glu 

1115 1120 

He Lou Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lys . Tyr He Asn Tyr 

1125 1^0 1135 

Arg Asp Leu Tyr lie Gly Glu Lys Phe He He Arg Arg Lys Sor Asn 

1140 H45 1150 

Ser Gin Senile Asn Asp Asp H^Val Arg Lys Glu Jsp Tyr lie Tyr 

Lou Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys 

1175 1180 

Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp 

1190 1195 1200 

Ser Asp Glu Phe Tyr Asn Thr lie Gin He Lys Glu Tyr Asp Glu Gin 

05 121° 1215 

Pro Thr Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 

1220 1225 123 0 

Asp Glu IieGly Leu He Gly lie His Arg Phe Tyr Glu Ser Gly lie 



1245 



Val Phe Glu Glu Tyr Lys Asp Tyr Phe Cys il e ser Lys Trp Tyr Leu 

1255 1260 
Ly^Glu Val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp 

1275 1280 
Gin Phe Ile Pro Lys Asp Glu Gly Trp Thr Glu 
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1285 1290 
(2) INFORMATION FOR SEQ ID NO: 43: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

iix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 108 1523 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

20 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT n 6 

Met Gly His 

25 1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
Hls Hls Hl s His His His His His Ser Ser Giv His lie Glu Glv 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT GAT ACA ATA CTA ATA GAA ATG TTT AAT 212 
Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu Met Phe Asn 
20 25 30 35 

AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT ATC TTA AAT TTA AGA TAT 260 
Lys Tyr Asn Ser Glu lie Leu Asn Asn He He Leu Asn Leu Arg Tyr 
40 45 50 

AGA GAT AAT AAT TTA ATA GAT TTA TCA GGA TAT GGA GCA AAG GTA GAG 308 
Arg Asp Asn Asn Leu He Asp Leu Ser Gly Tyr Gly Ala Lys Val Glu 
55 60 65 

GTA TAT GAT GGG GTC AAG CTT AAT GAT AAA AAT CAA TTT AAA TTA ACT 3 56 

Val Tyr Asp Gly Val Lys Leu Asn Asp Lys Asn Gin Phe Lys Leu Thr 
7 0 75 80 

AGT TCA GCA GAT ACT AAG ATT AGA GTC ACT CAA AAT CAG AAT ATT ATA 4 04 

Ser Ser Ala Asp Ser Lys He Arg Val Thr Gin Asn Gin Asn He He 
85 90 95 

TTT AAT AGT ATG TTC CTT GAT TTT AGC GTT AGC TTT TGG ATA AGG ATA 4 5? 

Phe Asn Ser Met Phe Leu Asp Phe Ser Val Ser Phe Trp He Arq He 
10 ° 105 no ' us 

CCT AAA TAT AGG AAT GAT GAT ATA CAA AAT TAT ATT CAT AAT GAA TAT 5 00 

Pro Lys Tyr Arg Asn Asp Asp He Gin Asn Tyr He His Asn Glu Tyr 
120 125 130 

ACG ATA ATT AAT TGT ATG AAA AAT AAT TCA GGC TGG AAA ATA TCT AT^ 54 8 

Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lys He Ser lie 
60 135 140 145 

AGG GGT AAT AGG ATA ATA TGG ACC TTA ATT GAT ATA AAT GGA AAA ACC 596 
Arg Gly Asn Arg He He Trp Thr Leu He Asp He Asn Gly Lys Thr 
150 155 160 

AAA TCA GTA TTT TTT GAA TAT AAC ATA AGA GAA GAT ATA TCA GAG TAT 64 4 

Lys Ser Val Phe Phe Glu Tyr Asn He Arg Glu Asp He Ser Glu Tyr 
165 170 175 

ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT AAT AAT TTG GAT AAT GCT 6 92 



40 



45 



50 



65 



70 
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lie Asn Arg Trp Phe Phe Val Thr lie Thr Asn Asn Leu Asp Asn Ala 
185 . 190 195 

AAA ATT TAT ATT AAT GGC ACG TTA GAA TCA AAT ATG GAT ATT AAA GAT 
Lys He Tyr He Asn Gly Thr Leu Glu Ser Asn Met Asp lie Lys Asp 
200 205 210 

lit r?* GT 7 T GTT ^ GGT GAA ATA ACA AAA TTA GAT GGT 

]() He Gly Glu Val lie Val Asn Gly Glu He Thr Phe Lys Leu Asp Gly 

220 225 

GAT GTA GAT AGA ACA CAA TTT ATT TGG ATG AAA TAT TTT AGT A^T TTT 
Asp Val Asp Arg Thr Gin Phe lie Trp Met Lvs Tvr Phe Ser lie Phe 
15 230 23 5 240 

AAT TTA CAA TCA AAT ATT AAA GAG ATA TAT AAA ATT CAA 

Asn Thr Gin Leu Asn Gin Ser Asn He Lys Glu He Tyr Lyt He eft 

^•±z> 250 



20 



.10 



40 



50 



60 



255 



TCA TAT AGC GAA TAC TTA AAA GAT TTT TGG GGA AAT CCT TTA ATG TAT 
Ser Tvr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro Leu Met Tyr 
ZbU 265 270 2 7S 

^1 GAA l AT TAT ATG TTT AA"^ GCG GGG AAT AAA AAT TCA TAT ATT 

Hsn Lys Glu Tyr Tvr Met Phe Asn Ala Gly Asn Lvs Asn Ser Tyr ill 

280 sss • 2 ; 0 

AAA CTA GTG AAA GAT TCA TCT GTA GGT GAA ATA TTA ATA CGT AGC AAA 
Lys Lou val Lys Asp Ser Ser Val Gly Glu He leu He Arg Ser Lvs 
' 95 300 3 05 

TAT AAT CAG AAT TCC AAT TAT ATA AAT TAT AGA AAT TTA TAT ATT CCA 
Tyr Asn Gin Asn Ser Asn Tyr lie Asn Tyr Arg £n Leu Tyr lie GiO 

315 320 

GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT TCT CAA TCT ATA AAT GAT 
Glu Lys Phe lie He Arg Arg Glu Ser Asn Ser G ft Icr lie Tsn Asp 
J * s 330 335 

GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT CTA GAT TTG GTA CTT CAC 
Asp He Val Arg Lys Glu Asp Tyr He His Leu Asp Leu Val Leu His 
° 345 350 355 

4> Stl 2** TGG AGA GTA TAT GCC TAT AAA TAT TTT AAG GAA CAG GAA 

■»> His Go, Glu Trp Arg Val Tyr Ala Tyr Lys Tyr Phe Lys Glu Gin GlC 



360 36 



J70 



GAA AAA TTG TTT TTA TCT ATT ATA AGT GAT TCT AAT GAA TTT TAT AAG 
Glu Lys Leu Phe Leu Ser He He Ser Asp Ser Asn Glu Phi Tyr I£ 
375 360 385 y 

Thr He eft iTe ^ G f l AT f T GAA CAG CCA TCA T AT AGT TGT CAG 
Thr He Glu He Lys Glu Tyr Asp Glu Gin Pro Ser Tyr Ser C V s Gin 

390 395 4 oo 

11?, f I 1 " AAA AAA GAT GAA GAA AGT ACT GAT GAT ATA GGA TTG ATT 

Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He Gly Leu lil 

410 415 

GGT ATT CAT CGT TTC TAC GAA TCT GGA GTT TTA CGT AAA AAG TAT AAA 
Gly lie His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lys Lys Tyr I£ 

425 430 435 



445 



450 



70 



CCA TAT AAG TCA AAT TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 
Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe Lie So JJJ £J 
455 460 465 



740 



788 



836 



864 



932 



980 



1028 



1076 



1124 



1172 



1220 



1268 



1316 



1364 



1412 



« til £ SI 51 12 E £ VS £ E £ SS S2 E i£ E 



1508 
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GAA GGG TGG ACT GAA TAA 

Glu Gly Trp Thr Glu 
470 

(2) INFORMATION FOR SEQ ID NO: 44: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY : linear 

<ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Met Gly His His His His His His His His His His Ser Ser Glv His 
1 5 io X 5 

He Glu Gly Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu 
20 25 30 

Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn lie lie Leu Asn 
35 40 45 

Leu Arg Tyr Arg Asp Asn Asn Leu He Asp Leu Ser Gly Tvr Glv Ala 
50 55 60 

Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys Asn Gin Phe 
65 7 ° 75 80 

Lys Leu Thr Ser Ser Ala Asp Ser Lys lie Arg Val Thr Gin Asn Gin 
85 90 95 

Asn lie lie Phe Asn Ser Met Phe Leu Asp Phe Ser Val Ser Phe Trp 
100 105 no 

He Arg lie Pro Lyr, Tyr Arg Asn Asp Asp He Gin Asn Tyr He His 
115 i20 125 

Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lvs 
i30 135 140 

lie Ser lie Arg Gly Asn Arg He He Trp Thr Leu He Asp He Asn 
145 ISO 155 160 

Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn lie Arg Glu Asp He 
165 170 175 

Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu 
180 185 190 

Asp Asn Ala Lys He Tyr He Asn Gly Thr Leu Glu Ser Asn Met Asp 
195 200 205 

He Lys Asp He Gly Glu Val He Val Asn Gly Glu He Thr Phe Lvs 
210 215 220 

Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met Lvs Tvr Phe 
225 230 235 * * 240 

Ser He Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lvs Glu He Tyr 
245 250 255 

Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro 
260 265 270 

Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Glv Asn Lys Asn 
275 280 285 

Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu lie Leu lie 
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290 295 



300 



10 



15 



40 



45 



50 



Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr lie Asn Tyr Arg Asn Leu 

310 315 320 

Tyr lie Gly Glu Lys Phe lie He Arg Arg Glu Ser Asn Ser Gin Ser 
32 5 330 335 

lie Asn Asp Asp lie Val Arg Lys Glu Asp Tyr He His Leu Asp Leu 
340 345 35 o F 

Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lys Tyr Phe Lys 



365 



Glu Gin Glu Glu Lys Leu Phe Leu Ser lie lie Ser Asp Ser Asn Glu 
J U 375 380 

Phe Tyr Lys Thr He Glu lie Lys Glu Tyr Asp Glu Gin Pro Ser Tyr 
b 390 395 400 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He 
405 410 41 5 

Gly Leu lie Gly lie His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lvs 
420 425 430 " 

Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lvs Glu Val 
43 5 440 445 ' 

Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe lie 
450 455 460 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 47 0 

(2) INFORMATION FOR SEQ ID NO : 4 5 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE ; DNA (genomic) 

fix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 108.. 152 3 

Ui) SEQUENCE DESCRIPTION : SEQ ID NO: 45: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 



60 

„. TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT n 6 

Met Gly His 
1 



H?I ful S AT 5 AT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 

6(J His His His His His His His His His Ser Ser Gly His lie Glu Gly 

J 10 15 

CGT CAT ATG GCT AGC ATG GCT GAT ACA ATA CTA ATA GAA ATG TTT AAT 
Arg His Met Ala Ser Met Ala Asp Thr He Leu He GlC Met Phe U5 
65 25 30 35 

AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT ATC TTA AAT TTA AGA TAT 

Lys Tyr Asn Ser Glu lie Leu Asn Asn He lie Leu Asn Leu Arg lyr 

40 4 5 50 

70 AAG GAT AAT AAT TTA ATA GAT TTA TCA GGA TAT GGG GCA AAG GTA GAG 



164 



211 



260 



308 
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Lys Asp Asn Asn Leu lie Asp Leu Ser Gly Tyr Gly Ala Lys Val Glu 
55 60 6 5 

CTA TAT GAT GGA GTC GAG CTT AAT GAT AAA AAT CAA TTT AAA TTA ACT 
Val Tyr Asp Gly Val Glu Leu Asn Asp Lys Asn Gin Phe Lys Leu" TnT 

ACT TCA GCA AAT ACT AAG ATT AGA GTG ACT CAA AAT CAG AAT ATC ATA 
1() Ser Ser Ala Asn Ser Lys lie Arg Val Thr Gin Asn Gin Asn lie lie 

90 95 

TTT AAT ACT GTG TTC CTT GAT TTT AGC GTT AGC TTT TGG ATA AGA ATA 
Phe Asn Ser Val Phe Leu Asp Phe Ser Val Ser Phe Trp lie £a il s 
15 105 110 115 

l C Z ?** l** ^ G AAT GAT GGT ATA CAA AAT ™T ATT CAT AAT GAA TAT 
Pro Lys Tyr Lys Asn Asp Gly He Gin Asn Tyr He His Asn Gli T^r 

120 



40 



50 



60 



130 



Thr tT A tT T ^ T l GT ATG *** ** T ^ TCG GGC TGG AAA ATA TCT ATT 
Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lys He Ser lie 

135 140 145 

™ GGT ™ T AGG ATA ATA TGG ACT TTA ATT GAT ATA AAT GGA AAA ACC 
Aig Gly Asn Arg lie lie Trp Thr Leu He Asp He Asn Gly JCS Thr 
ij0 1" 160 

AAA TCC1 GTA TTT TTT GAA TAT AAC ATA AGA GAA GAT ATA TCA GAG TAT 
Lys ser v.n Phe Phe Glu Tyr Asn He Arg Glu Asp He ler Glu Tyr 
lb - > 170 175 

ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT AAT AAT TTG AAT AAC OCT 
lie asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu Asn Asn All 

185 190 195 

AAA ATT TAT ATT AAT GGT AAG CTA GAA TCA AAT ACA GAT ATT AAA GAT 
Lys He Tyr lie Asn Gly Lys Leu Glu Ser Asn Thr Asp He Lvs Asp 
200 205 2 io 

ATA AGA GAA GTT ATT GCT AAT GGT GAA ATA ATA TTT AAA TTA GAT GGT 
Ue Arg Glu Val He Ala Asn Gly Glu He He Phe Lys Leu AsJ Gly 
215 220 225 

4S A A n tT A GAT AGA ^ A CAA TTT ATT TGG ATG AAA TAT TTC ACT ATT TTT 

-»■ Asp lie Asp Arg Thr Gin Phe lie Trp Met Lys Tvr Phe Ser He Phe 

220 235 



240 



AA n ?H G 7 TA AGT CAA TCA AAT ATT GAA GAA AGA TAT AAA ATT CAA 

Asn Thr Glu Leu Ser Gin Ser Asn He Glu Glu Arg Tyr Lys lie Gin 

<J4b 250 



255 



1^ l AT e G ° ^ ™ T TTA *** GAT TTT TGG GGA CCT TTA ATG TAG 

Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro Leu Met Tyr 

55 * 270 275 

AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG AAT AAA AAT TCA TAT ATT 

Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn Ser Tyr lie 

280 285 290 



3 56 



404 



4S2 



500 



54B 



596 



644 



692 



740 



788 



836 



684 



93: 



980 



lIm ?** ^ T T A CCT GTA GGT GAA ATT TTA ACA CGT AGC AAA 1028 

Lys Leu Lys Lys Asp Ser Pro Val Gly Glu He Leu Thr Arg Ser Lys 
295 300 305 



A> Z kT AAT CAA ^ TCT TAT ATA AAT TAT AGA GAT TTA TAT ATT CGA in -7* 

fo Tyr Asn Gin Asn Ser Lys Tyr lie Asn Tyr Arg Asp Leu ™ 111 Gly ™ * 

^10 315 320 
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10 



20 



30 



40 



50 



60 



GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT TCT CAA TCT ATA AAT GAT H24 
Glu Lys Phe lie lie Arg Arg Lys Ser Asn Ser Gin Ser lie Asn Asp 
325 330 335 



(i) SEQUENCE CHARACTERISTICS: 
d - <A) LENGTH: 472 amino acids 

H? (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



!ii) MOLECULE TYPE : protein 
Ui) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
Met Gly His His His His His His His His His His Ser Ser Gly His 

lie Glu Gly Arg His Met Ala Ser Met Ala Asp Thr lie Leu He Glu 
20 25 30 

Met Phe Asn Lys Tyr Asn Ser Glu lie Leu Asn Asn lie lie Leu Asn 
35 4 o 45 

Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Gly Tvr Gly Ala 
50 55 go 

6S L n Val ° 1U Val TYr ASP Gly Val Glu Leu Asn As P ^ s Asn Gin Phe 

^ 70 75 80 

Lys Leu Thr Ser Ser Ala Asn Ser Lys lie Arg Val Thr Gin Asn Gin 
Q 5 90 95 

70 Asn lie lie Phe Asn Ser Val Phe Leu Asp Phe Ser Val Ser Phe Trp 

- 305 - 



1172 



GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT CTA GAT TTT TTT AAT TTA 
Asp He Val Arg Lys Glu Asp Tyr He Tyr Leu Asp Phe Phe Asn Leu 
340 345 350 355 

rf 3 l GG AGA GTA TAT ACC TAT TAT TTT AAG AAA GAG GAA 12 2 0 

Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Phe Lys Lys Glu Glu 

360 365 37 0 

GAA AAA TTG TTT TTA GCT CCT ATA AGT GAT TCT GAT GAG TTT TAC AAT 
Glu Lys Leu Phe Leu Ala Pro lie Ser Asp Ser Asp Glu Phe Tyr Asn 
375 380 385 

ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG CCA ACA TAT AGT TGT CAG 
Thr He Gin He Lys Glu Tyr Asp Glu Gin Pro Thr Tyr Ser Cvs Gin 
390 395 400 

TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAG ATA GGA TTG ATT 
Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Glu He Gly Leu He 

410 415 

GGT ATT CAT CGT TTC TAC GAA TCT GGA ATT GTA TTT GAA GAG TAT AAA 
Gly lie His Arg Phe Tyr Glu Ser Gly He Val Phe Glu Glu Tyr Lys 
6 ° 426 -430 435 



1268 



1316 



1364 



1412 



1508 



GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA AAA GAG GTA AAA AGG AAA 1460 
Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val Lys Arg l£ 
440 445 X 450 

CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 
Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe He Pro Lys Asd 
455 460 4 6 5 

GAA GGG TGG ACT GAA TAAAAGCTTG CGGCCGCACT CGAG 

Glu Gly Trp Thr Glu 1547 
4 70 

12) INFORMATION FOR SEQ ID NO: 46: 



100 105 110 

He Arg lie Pro Lys Tyr Lys Asn Asp Gly lie Gin Asn Tyr lie His 
115 120 



125 



Asn Glu Tyr Thr lie He Asn Cys Met Lys Asn Asn Ser Gly Trp Lvs 

135 i 4 o 

lie Ser He Arg Gly Asn Arg lie He Trp Thr Leu He Asp He Asn 

150 155 160 

Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg Glu Asp He 
165 170 175 

Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu 
180 185 i9o 

Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser Asn Thr Asp 
195 200 205 

He Lys Asp lie Arg Glu Val He Ala Asn Gly Glu He lie Phe Lys 



220 



Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met Lys Tvr Phe 
230 235 • 24Q 

Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu Glu Arq Tvr 
245 250 2S5 ' 

Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro 
' 60 265 270 

Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn 

** /s 280 285 

Ser Tyr He Lys Leu Lys Lys Asp Ser Pro Val Gly Glu He Leu Thr 
" 295 3 0 o 

Arg Ser Lys Tyr Asn Gin Asn Ser Lys Tyr He Asn Tvr Arg Asp Leu 
* 310 3i 5 • 32Q 

Tyr He Gly Glu Lys Phe lie He Arg Arg Lys Ser Asn Ser Gin Ser 
325 330 335 

He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Tyr Leu Asp Pne 
340 345 3 5 o 

Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Phe ' vs 
355 360 365 

Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp Ser Asp Glu 
J u 375 380 

Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin Pro Thr Tyr 
385 390 395 400 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asd Glu He 
405 410 " 415 

Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly He Val Phe Glu 
4 '° 125 430 

Glu Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val 
^ J b 440 445 

Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe H<= 
qst - 455 460 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 47 0 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
<B} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = M DNA" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CGCCATGGCT GATACAATAC TAATAGAAAT G 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

txi) SEQUENCE DESCRIPTION : SEQ ID NO:48: 

GCAAGCTTTT ATTCAGTCCA CCCTTCATC 

(2) INFORMATION FOR SEQ ID NO: 49: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

<B) LOCATION: 1 . .3750 

txi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

ATG CCA ACA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT AAT AGA 
Met Pro Thr He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Arg 
1 5 * 10 15 

ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA CAA TTT TAT AAA TCA 
Thr lie Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 
20 25 30 

III ^ T t? T S TG ^ AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 
Phe Asn lie Met Lys Asn lie Trp lie lie Pro Glu Arg Asn Val He 
■* 5 4 0 45 

, G ? T f CCC GAT TTT CTT CCG CCT ACT TCA TTG AAA AAT GGA 

Gly Thr He Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Gly 
j0 55 go 

GAT AGT AGT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT CAA GAA AAG 
Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lvs 
65 70 75 go 

GAT AAA TTT TTA AAA ATA GTC ACA AAA ATA TTT AAT AGA ATA AAT GAT 
Asp Lys Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asp 
8 $ 90 95 



31 



48 



96 



144 



19: 



240 



288 
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65 



AAT CTT TCA GGA AGG ATT TTA TTA GAA GAA CTG TCA AAA GCT AAT CCA 

Asn Leu ser Gly Arg lie Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 no 

Itl Tin r? A ^ T n AT ACT CCA GAT GGT GAC TTC ATT A TT AAT GAT 

Tyr Leu Gly Asn Asp Asn Thr Pro Asp Gly Asp Phe lie He Asn Asp 

115 120 12 5 



a r*. /uw 111 ATT t-AA GAT CCT GCT CTT 
Arg Phe Lys Asp Asn Ser Met Asn Glu Phe lie Gin Asp Pro Ala Leu 
195 200 



336 



384 



GCA TCA GCA GTT CCA ATT CAA TTC TCA AAT GGT AGC CAA AGC ATA CTA 4^ 
10 Ala ser Ala Val Pro lie Gin Phe Ser Asn Gly Ser Gin Ser lie Leu ' 
iJU 135 

TTA CCT AAT GTT ATT ATA ATG GGA GCA GAG CCT GAT TTA TTT GAA ACT 
Leu Pro Asn Val lie lie Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
5 150 155 160 

AAC ACT TCC AAT ATT TCT CTA AGA AAT AAT TAT ATG CCA AGC AAT CAC 
Asn SeL Ser Asn lie Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
2() 165 170 175 

GGT TTT GGA TCA ATA GCT ATA GTA ACA TTC TCA CCT GAA TAT TCT TTT 
Gl/ Phe Gly Ser lie Ala lie Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 las igo 

AGA TTT AAA GAT AAT AGT ATG AAT GAA TTT ATT CAA GAT CCT GCT CTT 

*sp 
205 



480 



528 



576 



624 



720 



768 



816 



864 



M) Thf T A m TG 5 AT GAA T ™ ATA CAT TCA TTA CAT GGA CTA TAT GGG GCT 6 72 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
" 10 215 220 

AAA GGG ATT ACT ACA AAG TAT ACT ATA ACA CAA AAA CAA AAT CCC CTA 
Lys Gly U e Thr Thr Lys Tyr Thr lie Thr Gin Lys Gin Asn Pro Leu 

230 235 240 

ATA ACA AAT ATA AGA GGT ACA AAT ATT GAA GAA TTC TTA AC^ TTT GGA 
He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
4f, 245 250 255 

GGT ACT GAT TTA AAC ATT ATT ACT AGT GCT CAG TCC AAT GAT ATC TAT 
Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp Hp Tvr 
260 265 270 " 

ACT AAT CTT CTA GCT GAT TAT AAA AAA ATA GCG TCT AAA CTT AGC AAA 
Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lys Leu Ser Lys 
275 280 285 

SO v T ? ^ w T ? TCT MT CCA CTA CTT AAT CCT TAT AAA GAT GTT TTT GAA 

/di 010 Vai Ser Asn p ro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
290 295 300 

GCA AAG TAT GGA TTA GAT AAA GAT GCT AGC GGA ATT TAT TCG GTA AAT 
Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly lie Tyr Ser Val Asn 
JUb 310 315 320 

ATA AAC AAA TTT AAT GAT ATT TTT AAA AAA TTA TAC AGC TTT ACG GAA 
He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
60 325 33 0 335 

TTT GAT TTA GCA ACT AAA TTT CAA GTT AAA TGT AGG CAA ACT TAT ATT 
Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

GGA CAG TAT AAA TAC TTC AAA CTT TCA AAC TTG TTA AAT GAT TCT ATT 
Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 



960 



1008 



1056 



1104 



70 ^ T rT A TCA GAA GGC TAT AAT ATA ^ AAT TTA AAG GTA AAT TTT 1152 

W lyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
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370 375 



380 



15 



AGA °? A CAG MT GCA TTA CCT AGA ATT ATT ACA CCA ATT ACA 
Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg lie He Thr Pro He Thr 
3B5 390 



395 400 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 
Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
|() 405 410 415 

151 ° TA GGC A ? A AGG AAA TCA ATA TGT ATC GAA ATA AAT AAT GGT 

Ser Val Lys Gly lie Arg Lys Ser He Cys lie Glu He Asn Asn Gly 
420 425 430 

GAG TTA TTT TTT GTG GCT TCC GAG AAT ACT TAT AAT GAT GAT AAT ATA 
Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn I 
435 440 445 

•>() ^ ^ T n CT *** GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 13 92 

A5n ™r Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 392 
450 455 460 

GAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT ACT GAA TCA GCA 
Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 

" J Hbb 470 



47 5 480 

Pro I CA GAT GAA *** TTA ^ T TTA ACT ATC CAA AAT GAT GCT 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr lie Gin Asn Aso Ala 

30 485 490 495 

'fvr jit d CA Z AT GAT TCT ^ T GGA ACA AGT GAT ATA GAA CAA CAT 

ryr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 

500 505 510 

35 



630 6 35 



640 



TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA 
Leu lie Pro Thr lie Leu Val Phe Thr lie Lys Ser Phe llu Gly Ser 
70 645 650 655 



1200 



1248 



1296 



1344 



1440 



1488 



1536 



1584 



a™ STT ^ ^ ? TT MT G ™ TTT TTC TAT TTA GAT GCA CAG AAA GTG 
Asp Val Asn Glu Leu Asn Val Phe Phe .Tyr Leu Asp Ala Gin Lvs Val 

515 520 525 

40 o CC ^ G ^ GTC AAT CTC ACC TC T TCA ATT GAT ACA GCA 16 32 

40 Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 

530 535 540 

TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT 
4S c!c LeU ° 1U Gln Pro L ^ S Ile T ^ Thr Phe Phe Ser Ser Glu Phe He 

550 555 5 6 o 

AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 
Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp lie 
5() 565 570 575 

G^ rft SI? J TA GAT " T ACT ACT GAA GCT MC CAA AAA AGT ACT 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 

580 585 590 

" SIT a AT ?T T G ? A GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 

Val Asp Lys Ile Ala Asp Ile Ser Ile Val Val Pro Tyr Ile Gly Leu 
595 600 605 

AO ^ I™ ^ T A T A GGA ^ GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 18 7' 

60 Ala Leu Asn lie Gly Asn Glu Ala Gin Lys Gly Asn Phe Lvs Asp Ala 

610 615 6 20 

CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CTT 
fi5 Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 



L680 



1728 



1776 



1824 



1920 



1968 
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45 



III a" AA T AAA AAT AAA GTT ATT AAA GCA ATA AAT AAT GCA TTG AAA 
Ser Asp ash Lys Asn Lys Val lie Lys Ala lie Asn Asn Ala Leu J£ 
660 665 



670 



TAT CAA GCT TTA CAA AAT CAA GTA AAT GCA CTT AAA GCA ATA ata raa 
Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala lIu J£ lie lie Su" 

705 710 715 ^ biu 

Zl ™° 1* T *»* AGT TAT ACT TTA GAA GAA AAA AAT GAG CTT ACA AAT 
Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr A^n 
725 7 30 735 

AAA TAT GAT ATT GAG CAA ATA GAA AAT GAA CTT AAT CAA AAG GTT TCT 
Lys Tyr Asp lie Glu Gin lie Glu Asn Glu Leu Asn Gin Ly"s Val Ser 
740 745 150 

ATA GCA ATG AAT AAT ATA GAC AGG TTC TTA ACT GAA AGT TCT ATA TCT 
He AJa Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser Ue Sr 
755 760 765 



TAT GAT GAA AAT GTT AAA ACG TAT TTA TTA GAT TAT ATT ATA AAA CAT 
Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asp Tyr lie He L^ Hxl 

790 795 8 oo 

GGA TCA ATC TTG GGA GAG AGT CAG CAA GAA CTA AAT TCT ATG GTA ATT 
Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val III 
40 805 810 815 

GAT ACC CTA AAT AAT AGT ATT CCT TTT AAG CTT TCT TCT TAT ACA GAT 
Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr A^d 
820 825 8 3 0 

Asn rT T 7 TA * A T T TCA TAT TTT *** ™ G TTC TTT AAG AGA ATT AAA 

Asp L yS He Leu He Ser Tvr Phe Asn Lys Phe Phe Lys Arq He Lys 

bib 840 _ . _ - 7 



845 



2016 



2064 



5 Glu Am ^ T ^ ^ TGG *** GAA GTA TAT AGT TTT ft TA GTA TCG AAT 

Glu Aig Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe lie Val Ser Asn 
675 680 685 

TGG ATG ACT AAA ATT AAT ACA CAA TTT AAT AAA AGA AAA GAA CAA ATP 
10 Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg JJJ g^ £J 2? 

695 700 



2160 



2208 



2256 



2304 



M) 7™ ATG *** TTA ATA ^ GAA GTA AAA ATT AAT AAA TTA AGA GAA n« 

-'0 Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg 

U 775 78o 



2400 



2448 



2496 



2544 



SO I CT GT T TTA ^T ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT 2 5 9^ 

B^o ^ ASn Met Al " g Tyr Lys Asn Asp Lys Tyr Val Asp 

855 860 



ThI £r r? l T I CA AAT ATA AAT ATT *»* GGA GAT GTA TAT AAA 

55 «S V ' yr ASP Ser Asn Ile Asn Ile Asn G1 V A SP Val Tyr Lys 

865 870 875 e Y g 0 

d CA A P AAT *** *** CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 

6() 885 890 895 

r^l ITT ^ A T A TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 
Glu Val Asn He Ser Gin Asn Asp Tyr He Ile Tyr Asp Asn Lys Tyr 
900 905 910 

65 AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 

Lys Asn Phe Ser lie Ser Phe Trp Val Arg He Pro Asn Tyr Asp i£n 

91 = 920 925 

70 ?So ^ T GT T GAA TAC ACT ATA ATA AAT TGT ATG AGG 283<> 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 
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2640 



2688 



2736 



2784 
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930 93S 



940 



a=I ^ ^ T TCA GGA TGG *** GTA TCT CTT AAT CAT AAT GAA ATA ATT 
5 94? ASn ASn Ser Gly Tr P ^ Val Ser Leu Asn His Asn Glu lie nl 

950 955 gee 

^ro Thr 73!! GAT AAT TCA GGA ATT ^ CAA TTA GCA TTT AAC 

Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 

|() 965 970 975 

Tvr r? T AAC ^ A ^ GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 
lyr Gly Asn Ala Asn Gly He Ser Asp Tyr lie Asn Lys Trp He Phe 
980 9B5 990 

15 Sit Th T tT A ^ GAT AGA ™ GGA GAT TCT AAA CTT TAT ATT AAT 

Val Thr lie Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie J£ 
995 1000 100 5 

■>0 G?v tT A GAT *** *** TCA ATT TTA AAT TTA GGT AAT ATT CAT 

Gly Asr^Leu lie Asp Lys Lys^Ser He Leu Asn Leu^Gly Asn lie His 

GTT ACT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT ACT TAT ACA AGA 

■>5 io's P IiG ^ Phe LyS Ue Val Asn C V S Ser Tyr Thr Arg 

1030 1035 1040 

lv~ ■ II CW t7 AGA ^ T 17 *** ATT TTT GAT *** CAA TTA GAT GAA 
TV He Gly He Arg Tyr Phe Asn lie Phe Asp Lys Glu Leu Asp Glu 

;,() 1045 1050 ioS5 

ctC m I TH T T A ^ T AAC AAT GAA CCT GCA AAT ATT TTA 

■ nr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 

1060 1065 1070 

35 AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tvr lit 
1075 1080 108 5 

40 Leu £n" SI? 71* n CA MT MC TTT ATT *** AGG AGA ACA CAT TCT 3312 

tnan eU LyS Pr ° Asn Asn Phe lle Asn ft rg Arg Thr Asp Ser 

1090 1095 iioo 

ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 
Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 

mo n 15 

leu Tvl ^ r? A rT A *** GT T *** A ™ CAA ACA GTT AAT AAT ACT ACT 
Leu Tyi Ser Gly He Lys Val Lys lie Gin Arg Val Asn Asn Ser Ser 

5() 1125 H J0 1135 

ThI AAC ^ T AAT CTT GTT AGA AAG MT GAT CA G GTA TAT ATT AAT TTT 
Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Pne 

1140 H45 iiso 

33 vll A?^ AGC ?** ^ T CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 . HBO 1165 

£ tzjz sk £ s s: # ?s s s s & s e 3552 

U 1175 iiao 

AAT CAA GTA GTA GTT ATG AAT TCA GTA GGA TGT ACA ATG AAT TTT AAA 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3360 



3408 



3456 



3504 



3600 



1190 1195 



1200 
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60 



Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lys 
65 70 75 80 

Asp Lys Phe Leu Lys lie Val Thr Lys He Phe Asn Arg lie Asn Asp 
85 90 95 

Asn Leu Ser Gly Arg lie Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 no 

Tyr Leu Gly Asn Asp Asn Thr Pro Asp Gly Asp Phe lie lie Asn Asp 
H5 120 125 

Ala Ser Ala Val Pro lie Gin Phe Ser Asn Gly Ser Gin ser lie Leu 
130 135 140 

Leu Pro Asn Val lie lie Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
145 150 155 160 

Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 175 

Gly Phe Gly Ser lie Ala lie Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

Arg Phe Lys Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 

Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 



3648 



3696 



3744 



AAT AAT AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT 
Asn Asn Asn Gly Asn Asn lie Gly Leu Leu Glv Phe Lys Ala Asp Thr 
1205 1210 * 1.215 

GTA GTT GCT ACT ACT TGG TAT TAT ACA CAT ATG AGA GAT AAT ACA AAC 
Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr 
1220 1225 1230 

£r £n IT" T l T TG ° ^ ^ ATT TCT GAA GAA CAT GGA TGG CAA 

Ser Asn Gly Phe Phe Trp Asn Phe lie Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 

GAA AAA TAA 

Glu Lys 37 53 
1250 

(2) INFORMATION FOR SEQ ID NO: 50: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 50 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 i nea r 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Pro Thr He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Arg 

Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 
20 25 30 

Phe Asn He Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 

Gly Thr lie Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Glv 
50 55 7 
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225 

He Thr Asn He 



10 



20 



30 



40 



45 



50 



60 



65 



70 



Gly Thr Asp Leu 
260 

Thr Asn Leu Leu 
275 

Val Gin Val Ser 
290 

Ala Lys Tyr Gly 
305 

He Asn Lvs Phe 



Phe Asp Leu Ala 

340 

Gly Gin Tyr Lys 

3 5 5 

Tyr Asn He Ser 
370 

Arg Gly Gin Asn 
385 

Gly Arg Gly Leu 



Ser Val Lys Gly 
420 

Glu Leu Phe Phe 
435 

Asn Thr Pro Lys 
450 

Glu Asn Asp Leu Asp Gin 
465 47 0 



230 

Arg Gly 
245 

Asn He 

Ala Asp 

Asn Pro 

Leu Asp 
310 

Asn Asp 
325 

Thr Lys 

Tyr Phe 

Glu Gly 

Ala Asn 
390 

Val Lys 
405 

He Arg 
Val Ala 
Glu He 



Thr Asn 

lie Thr 

Tyr Lys 
280 

Leu Leu 
295 

Lys Asp 
He Phe 
Phe Gin 



Lys Leu 
360 

Tyr Asn 
375 

Leu Asn 



Lys He 
Lys Ser 



Ser Glu 
440 



Asp Asp 
455 



235 

He Glu Glu 
250 

Ser Ala Gin 
265 

Lys He Ala 

Asn Pro Tyr 

Ala Ser Gly 
315 

Lys Lys Leu 
330 

Val Lys Cys 
345 

Ser Asn Leu 

He Asn Asn 

Pro Arg He 
395 

He Arg Phe 
410 

He Cys He 
425 

Asn Ser Tyr 
Thr Val Thr 



Phe 



Ser 



Ser 



Lys 
300 



240 

Leu Thr Phe Gly 
255 

Asn Asp lie Tyr 
270 

Lys Leu Ser Lys 
285 

Asp Val Rhe Glu 



lie 

Xyr 

Arg 
Leu 



Leu 
380 



Tyr Ser Val Asn 
320 

Ser Phe Thr Glu 
335 

Gin Thr Tyr He 
350 

Asn Asp Ser lie 
365 

Lys Val Asn Phe 



He Thr Pro He Thr 
400 

Cys Lys Asn lie Val 
415 

Glu lie Asn Asn Gly 
430 

Asn Asp Asp Asn lie 
445 

Asn Asn Asn Tvr 



Pro Gly Leu Ser 



Tyr lie Pro Lys 
500 

Asp Val Asn Glu 
515 

Pro Glu Gly Glu 
530 

Leu Leu Glu Gin 
54 5 

Asn Asn Val Asn 



Gin Gin Val Leu 
580 

Val Asp Lys lie 
595 



Asp Glu 
485 

Tyr Asp 

Leu Asn 

Asn Asn 

Pro Lys 
550 

Lys Pro 
565 

Val Asp 
Ala Asp 



Val He 

Lys Leu 

Ser Asn 

Val Phe 
520 

Val Asn 
535 

lie Tyr 
Val Gin 
Phe Thr 



Leu Asn Phe 
475 

Asn Leu Thr 
490 

Gly Thr Ser 
505 

Phe Tyr Leu 
Leu Thr Ser 



Ser 
460 

Asn 



Thr Phe Phe 
555 

Ala Ala Leu 
570 

Thr Glu Ala 
585 



He 
Asp 
Asp 



Ser 
540 

Ser 



Ser Glu Ser Ala 
480 

Gin Asn Asp Ala 
495 

lie Glu Gin His 
510 

Ala Gin Lys Val 
525 

lie Asp Thr Ala 



Phe 



Asn 



lie Ser 
600 



He Val Val Pro 



Ser Glu Phe lie 
560 

Val Ser Trp He 
575 

Gin Lys Ser Thr 
590 

Tyr lie Gly Leu 
605 
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Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 



620 



Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
630 63S 640 

Leu He Pro Thr lie Leu Val Phe Thr lie Lys Ser Phe Leu Gly Ser 
645 650 65 £ 

ser Asp Asn Lys Asn Lys Val lie Lys Ala He Asn Asn Ala Leu Lys 
660 6 65 gVO 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe lie Val Ser Asn 
675 660 685 

Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 

695 700 

Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala Leu Lys Ala He He Glu 

710 715 720 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

Lys Tyr Asp He Glu Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 76S 

Tyr Leu Met Lys Leu He Asn Glu Val Lys lie- Asn Lys Leu Arg Glu 
u 775 790 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asp Tyr He He Lys His 
85 790 795 soo 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val lie 
805 810 81S 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tvr Thr Asd 
820 825 830 

Asp Lys lie Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
835 840 845 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tvr Val Asp 

asv 855 - r 



860 



Thr Ser Gly Tyr Asp Ser Asn lie Asn lie Asn Gly Asp Val Tyr Lys 
865 870 875 aao 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
885 890 895 

Glu Val Asn He Ser Gin Asn Asp Tyr lie lie Tyr Asp Asn Lys Tyr 
900 905 910 

Lys Asn Phe Ser lie Ser Phe Trp Val Arg lie Pro Asn Tyr Asp Asn 
915 920 925 

Lys lie Val Asn Val Asn Asn Glu Tyr Thr lie lie Asn Cys Met Arg 
930 935 940 9 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 960 

Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

Tyr Gly Asn Ala Asn Gly lie Ser Asp Tyr He Asn Lys Trp He Phe 
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980 985 990 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 1005 

Cly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 
1010 1015 1020 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Ara 
1025 1030 10 35 10 40 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

LeU ?n^ Val LeU LyS Pr ° Asn Asn Phe Ile Asn Arg Arg Thr Asp Ser 
1090 1095 iioo 

Thr Leu Ser Ile Asn Asn Ile Arg Ser Thr He Leu Leu Ala Asn Arg 
1105 1110 H15 1120 

Leu Tyr Ser Gly Ile Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
ll 25 1130 H35 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr Ile Asn Phe 
1140 H45 H50 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 H60 H65 

Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe 
1170 H75 neo 

Asn Gin Val Val Val Met Asn Ser Val Gly Cys Thr Met Asn Phe Lys 
1185 1190 1195 1200 

Asn Asn Asn Gly Asn Asn Ile Gly Leu Leu Gly Phe Lys Ala Asp Thr 
1205 1210 1215 

Val Val Ala Ser Thr Trp Tyr Tyr Thr His Mec Arg Asp Asn Thr Asn 
12 20 1225 1230 

Ser Asn Gly Phe Phe Trp Asn Phe Ile Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 

Glu Lys 
1250 

(2) INFORMATION FOR SEQ ID NO: 51: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

fii) MOLECULE TYPE : DNA (genomic) 

(ix> FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .3756 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
ATG CCA AAA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT GAT AGA 
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Met Pro Lys He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp Arg 
5 10 15 

ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA GAA TTT TAT AAA TCA 
Thr lie Leu Tyr He Lys Pro Gly Gly Cys Gin Glu Phe Tyr Lys Ser 
20 25 30 

III £1 tT T £ TG AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 

Phe Asn lie Met Lys Asn He Trp lie He Pro Glu Arg Asn Val lie 
35 40 45 

GGT ACA ACC CCC CAA GAT TTT CAT CCG CCT ACT TCA TTA AAA AAT CCA 
Gly Thr Thr Pro Gin Asp Phe His Pro Pro Thr Ser Leu Lys Gly 

55 60 

GAT AGT ACT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT GAA GAA AAG 
Asp ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp ™ 

70 75 go 

t*l Ik T TTA *** ATA GTC ACA *** ATA TTT AAT AGA ATA AAT AAT 

Asp Arg Phe Leu Lys He Val Thr Lys lie Phe Asn Arg lie iSn A^n 
85 90 95 

AAT CTT TCA GGA GGG ATT TTA TTA GAA GAA CTG TCA AAA GCT AAT ~rh 
Asn Leu Ser Gly Gly lie Leu Leu Glu Glu Leu Ser Lys All J£ Pro 
100 105 110 

T-r 71* n? G AAT GAT AAT ACT CCA GAT MT GAA TTC CAT ATT GGT GAT 
Tyr Leu Gly Asn Asp Asn Thr Pro Asp Asn Gin Phe His He Gly Asp 

120 125 

Ma Ser a£ SIT rV *** TTC TCA ^ T GGT AGC CAA GAC "A CTA 

Ala ser Ala Val Glu lie Lys Phe Ser Asn Gly Ser Gin Asp lie Leu 

135 140 

TTA CCT AAT GTT ATT ATA ATG GGA GCA GAG CCT GAT TTA TTT GAA ACT 
^ "° Asn Vai He Met Gly Ala Glu Pro Aso Leu Phe Glu Thr 

145 150 155 ' 160 

A^n Zl l CC ^ T ,7 TCT CTA AGA AAT *** TAT ATG C ^ AGC AAT CAC 
Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 

165 170 175 

GGT TTT GGA TCA ATA GCT ATA GTA ACA TTC TCA CCT GAA TAT TC~ TTT 
«ly Phe Gly ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser "he 
180 185 190 

A™ dk T ^ ? AT AAT AGT ATG AAT GAA TTT ATT CAA GAT CCT GCT CTT 
Arg Phe Asn Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

Thr l I» S AT 2** TTA A ™ ° AT TCA TTA CAT GGA CTA TAT GGG GCT 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 

2X5 220 

AAA GGG ATT ACT ACA AAG TAT ACT ATA ACA CAA AAA CAA AAT CCC CTA 
Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin A^n Pro 

230 235 240 

lie Thr A^n tT A £ GA S? T A ^ A AAT ATT GAA GAA TTC TTA ACT TTT GGA 
He rhi Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 

245 250 255 

6S r?3 £f T ^ TTA AAC ATT ATT ACT AGT GCT CAG T ^C AAT GAT ATC TAT 816 

M Gly Thr Asp Leu Asn He lie Thr Ser Ala Gin Ser Asn Asp lie Tyr 

260 265 270 



.10 



40 



50 



60 



96 



144 



192 



240 



268 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



70 



ACT AAT CTT CTA GCT GAT TAT AAA AAA ATA GCG TCT AAA CTT AGC AAA 
Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lvs Leu Ser lCJ 
275 280 285 



864 
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GTA CAA 
Val Gin 
290 

GCA AAG 
Ala Lys 
305 

ATA AAC 
He Asn 



GTA TCT AAT 
Val Ser Asn 



TAT GGA TTA 
Tyr Gly Leu 



AAA- TTT 
Lys Phe 



TTT GAT 
Phe Asp 



GGA CAG 
Gly Gin 



TAT 
Tyr 



AGA 
Arg 
385 

GGT 
Gly 



AAT 
Asn 

370 

GGA 
Gly 



TTA GCA 
Leu Ala 
340 

TAT AAA 
Tyr Lys 
355 

ATA TCA 
He Ser 



AAT 
Asn 
325 

ACT 
Thr 



CCA CTA 
Pro Leu 
295 

GAT AAA 
Asp Lys 
310 

GAT ATT 
Asp He 



CTT AAT CCT 
Leu Asn Pro 



GAT GCT AGC 
Asp Ala Ser 



AAA TTT 
Lys Phe 



TAC 
Tyr 



GAA 
Glu 



CAG AAT GCA 
Gin Asn Ala 



AGA 
Arg 



TCT GTA 

Ser Val 



GAG TTA 
Glu Leu 



AAT ACT 
Asn Thr 
450 

GAA AAT 
Glu Asn 
465 

CCT GGA 
Pro Gly 

TAT ATA 
Tyr He 



GGA CTA GTA 
Gly Leu Val 
405 

AAA GGC ATA 
Lys Gly lie 
420 

TTT TTT GTG 
Phe Phe Val 
435 

CCT AAA GAA 
Pro Lys Glu 



TTC AAA 
Phe Lys 

GGC TAT 

Gly Tyr 
375 

AAT TTA 
Asn Leu 
390 

AAA AAA 

Lys Lys 



TTT AAA AAA 
Phe Lys Lys 
330 

CAA GTT AAA 
Gin Val Lys 
345 

CTT TCA AAC 
Leu Ser Asn 
360 

AAT ATA AAT 
Asn He Asn 



TAT AAA GAT GTT 
Tyr Lys Asp Val 
300 

GGA ATT TAT TCG 
Gly He Tyr Ser 
315 

TTA TAC AGC TTT 
Leu Tyr Ser Phe 



TTT GAA 
Phe Glu 



TGT AGG 
Cys Arg 



TTG TTA 
Leu Leu 



AAT CCT AGA 
Asn Pro Arg 



AGG AAA 
Arg Lys 



GCT TCC 
Ala Ser 



GAT TTA GAT 
Asp Leu Asp 



CT' 7 * 
Leu 



CCA 
Pro 



GAT GTT 
Asp Val 



CCC GAA 
Pro Glu 
530 



AAT 
Asn 
515 

GGT 
Glv 



TCA GAT 
Ser Asp 
485 

AAA TAT 
Lys Tyr 
500 

GAA CTT 
Glu Leu 



ATT GAC 
He Asp 
455 

CAG GTT 
Gin Val 
470 

GAA AAA 
Glu Lys 



ATC ATT AGA 
He He Arg 
410 

TCA ATA TGT 
Ser He Cys 
425 

GAG AAT AGT 
Glu Asn Ser 
440 

GAT ACA GTA 
Asp Thr Val 



AAT TTA 
Asn Leu 
380 

ATT ATT 
He He 
395 

TTT TGT 

Phe Cyj 



CAA ACT 
Gin Thr 
350 

AAT GAT 
Asn Asp 
365 

AAG GTA 
Lys Val 



GTA AAT 
Val Asn 
320 

ACG GAA 
Thr Glu 
335 

TAT ATT 
Tyr He 



TCT ATT 
Ser He 



AAT TTT 
Asn Phe 



ACA CCA 
Thr Pro 



AAA AAT 
Lys Asn 



ATC 
He 



TAT 
Tyr 



GAA 
Glu 



AAT 
Asn 



ATA AAT 
He Asn 
430 

GAT GAT 
Asp Asp 
445 



ATT ACA 
He Thr 
400 

ATT GTT 
He Val 
415 

AAT GGT 
Asn Gly 



AAT ATA 
Asn He 



ACT 
Thr 



ATT TTA AAT 
He Leu Asn 



TTA 
Leu 



GAT TCT 
Asp Ser 



AAT GTA 
Asn Val 



GAA AAT 
Glu Asn 



TTA TTA 
Leu Leu 
545 

AAT AAT 
Asn Asn 



GAA CAA CCT 
Glu Gin Pro 



GTC AAT AAA 
Val Asn Lys 



AAT GTC 
Asn Val 
535 

AAA ATA 
Lys He 
550 

CCT GTG 
Pro Val 



AAT 
Asn 



TTT 
Phe 
520 

AAT 
Asn 



AAT TTA 
Asn Leu 
490 

GGA ACA 
Gly Thr 
505 

TTC TAT 
Phe Tyr 



TTT 
Phe 
475 

ACT 
Thr 



AGT 
Ser 



TCA 
Ser 
460 

AAT 
Asn 



AAT AAT 
Asn Asn 



AGT GAA 
Ser Glu 



ATC CAA AAT 
He Gin Asn 



GAT 
Asp 



TTA 
Leu 



CTC ACC 
Leu Thr 



TAT ACA TTT 
Tyr Thr Phe 



CAA GCA GCA 
Gin Ala Ala 



TCT 
Ser 



TTT 
Phe 
555 

TTA 
Leu 



GAT 
Asp 



TCA 
Ser 
540 

TCA 
Ser 



ATA GAA 
lie Glu 
510 

GCA CAG 
Ala Gin 
525 

ATT GAT 
He Asp 



AAT TAT 
Asn Tyr 

TCA GCA 
Ser Ala 
480 

GAT GCT 
Asp Ala 
495 

CAA CAT 
Gin His 



AAA GTG 
Lys Val 



ACA GCA 
Thr Ala 



TCA GAA 
Ser Glu 



TTT GTA AGC 
Phe Val Ser 



TTT ATT 
Phe He 
560 

TGG ATA 
Trp He 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



148B 



1536 



1584 



1632 



1680 



1728 
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565 570 



575 



CAA CAA GTG TTA GTA GAT TTT ACT ACT GAA GCT AAC CAA AAA ACT ACT 
Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 5 9 o 

GTT GAT AAA ATT GCA GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 
Val Asp Lys He Ala Asp lie Ser He Val Val Pro Tyr lie Gly Leu 
)0 595 600 605 

OCT TTA AAT ATA GGA AAT GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 
Ala Leu Asn lie Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 

biU 615 620 



30 



50 



65 



70 



TCT GAT AAT AAA AAT AAA GTT ATT AAA GCA ATA AAT AAT GCA TTG AAA 
Ser Asp Asn Lys Asn Lys Val lie Lys Ala He Asn Asn Ala Leu Lys 
660 665 6 70 

GAA AGA GAT GAA AAA TGG AAA GAA GTA TAT AGT TTT ATA GTA TCG AAT 
Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe lie Val Ser Asn 
675 680 6 85 

mI? £f T tT T AAT ACA CAA TTT AAT AAA AGA AAA GAA CAA ATG 

Trp Met Thr Lys lie Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 
690 695 700 

TAT CAA GCT TTA CAA AAT CAA GTA AAT GCA ATT AAA ACA ATA ATA GAA 
Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala lie Lys Thr He He Glu 
705 710 * 715 7^0 



TAT GAT GAG AAT GTC AAA ACG TAT TTA TTG AAT TAT ATT ATA CAA CAT 
Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asn Tyr He He Gin His 
785 79 ° 795 -800 



GAT ACC CTA AAT AAT AGT ATT CCT TTT AAG CTT TCT TCT TAT ACA GAT 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 
820 625 B30 

GAT AAA ATT TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 

83 $ 840 845 



1776 



1824 



1872 



1920 



I } CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CT^ 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
tb 630 635 640 

TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA I9 6fl 
-0 Leu He Pro Thr He Leu Val Phe Thr lie Lys Ser Phe Leu Gly Ser 

645 • 650 655 



2016 



2064 



:ii2 



2160 



TCT AAG TAT AAT AGT TAT ACT TTA GAG GAA AAA AAT GAG CTT ACA AAT -2 08 

W Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 

7 25 730 735 

AAA TAT GAT ATT AAG CAA ATA GAA AAT GAA CTT AAT CAA AAG GTT TCT 
Lys Tyr Asp He Lys Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
4 ' 740 745 750 

ATA GCA ATG AAT AAT ATA GAC AGG TTC TTA ACT GAA AGT TCT ATA TCC 
He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

TAT TTA ATG AAA TTA ATA AAT GAA GTA AAA ATT AAT AAA TTA AGA GAA 
Tyr Leu Met Lys Leu He Asn Glu Val Lys lie Asn Lys Leu Arq Glu 
770 775 780 



2256 



2304 



2352 



2400 



AO ^? I A T C TTG GGA GAG AGT CAG C AA GAA CTA AAT TCT ATG GTA ACT 24 4 8 

60 Gly ser lie Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val Thr 

805 aio ei5 



2496 



2544 
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20 



AGT AGT TCA GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAG GTA GAT 
Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
850 855 B60 

ACT TCA GGA TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA 
Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lvs 
865 87 0 875 880 



GAA GTT AAT ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 
Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tvr 
900 90S 910 

AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 
Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asn Asn 
915 920 925 

AAG ATA GTA AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGA 
Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arq 
930 935 940 * 

GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT 
Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 9 6 o 



2592 



2640 



TAT CCA ACT AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 26 8 6 

10 Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 

885 890 895 



2736 



2784 



2832 



2880 



30 


TGG 
Trp 


ACA TTG 
Thr Leu 


CAA 
Gin 


GAT 
Asp 
965 


AAT 
Asn 


GCA 
Ala 


GGA 
Gly 


ATT 

lie 


AAT 
Asn 
970 


CAA 
Gin 


AAA 
Lys 


TTA GCA 
Leu Ala 


TTT 
Phe 

975 


AAC 
Asn 


2928 


35 


TAT 
Tyr 


GGT AAC 
Gly Asn 


GCA 
Ala 
980 


AAT 

Asn 


GGT 

Gly 


ATT 
He 


TCT 
Ser 


GAT 
Asp 
985 


TAT 
Tyr 


ATA 
He 


AAT 
Asn 


AAG TGG 
Lys Trp 
990 


ATT 
lie 


TTT 
Phe 


2976 


40 


GTA 
Val 


ACT ATA 
Thr lie 
995 


ACT 
Thr 


AAT 
Asn 


GAT 
Asp 


AGA 
Arg 


TTA GGA GAT 
Leu Gly Asp 
1000 


TCT 
Ser 


AAA 
Lys 


CTT TAT 
Leu Tyr 
1005 


ATT 
lie 


AAT 
Asn 


3024 




GGA 
Gly 


AAT TTA 
Asn Leu 
1010 


ATA 
He 


GAT 
Asp 


CAA 
Gin 


AAA TCA 
Lys Ser 
1015 


ATT 
He 


TTA 
Leu 


AAT 
Asn 


TTA GGT AAT 
Leu Gly Asn 
1020 


ATT 
He 


CAT 
His 


3072 


45 


GTT AGT GAC 
Val Ser Asp 
1025 


AAT 
Asn 


ATA 
He 


TTA TTT 
Leu Phe 
1030 


AAA 

Lys 


ATA 
He 


GTT 
Val 


AAT TGT 
Asn Cys 
1035 


AGT TAT 
Ser Tyr 


ACA 
Thr 


AGA 
Arg 
1040 


3120 


50 


TAT 
Tyr 


ATT GGT 
He Gly 


ATT 
He 


AGA TAT 
Arg Tyr 
1045 


TTT 
Phe 


AAT 
Asn 


ATT 
He 


TTT GAT 
Phe Asp 
1050 


AAA 
Lys 


GAA TTA 
Glu Leu 


GAT GAA 
Asp Glu 
1055 


3168 


55 


ACA 
Thr 


GAA ATT 
Glu lie 


CAA ACT 
Gin Thr 
1060 


TTA 

Leu 


TAT 
Tyr 


AGC 
Ser 


AAT GAA 
Asn Glu 
1065 


CCT 
Pro 


AAT 
Asn 


ACA AAT ATT 
Thr Asn lie 
1070 


TTG 
Leu 


3216 


60 


AAG 
Lys 


GAT TTT TGG 
Asp Phe Trp 
1075 


GGA 
Gly 


AAT 
Asn 


TAT 
Tyr 


TTG 
Leu 
108C 


CTT 
Leu 

I 


TAT 
Tyr 


GAC 
Asp 


AAA 
Lys 


GAA TAC 
Glu Tyr 
1085 


TAT 
Tyr 


TTA 
Leu 


3264 




TTA 
Leu 


AAT GTG 
Asn Val 
1090 


TTA 

Leu 


AAA 

Lys 


CCA 
Pro 


AAT 
Asn 
1095 


AAC 
Asn 


TTT 
Phe 


ATT 

lie 


GAT 
Asp 


AGG 
Arg 
HOC 


AGA AAA 
Arg Lys 

I 


GAT 
Asp 


TCT 
Ser 


3312 


65 


ACT 
Thr 
1105 


TTA AGC 
Leu Ser 


ATT 
He 


AAT 
Asn 


AAT 
Asn 
1110 


ATA 
He 


AGA 
Arg 


AGC 
Ser 


ACT 
Thr 


ATT 
He 
1115 


CTT 
Leu 


TTA GCT 
Leu Ala 


AAT 
Asn 


AGA 
Arg 
1120 


3360 


70 


TTA 
Leu 


TAT AGT 
Tyr Ser 


GGA 
Gly 


ATA 
He 


AAA 

Lys 


GTT 

Val 


AAA 
Lys 


ATA 
He 


CAA 
Gin 


AGA 
Arg 


GTT 
Val 


AAT AAT 
Asn Asn 


AGT 
Ser 


AGT 

Ser 


3408 
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1125 1135 

Thr ^ T ^ T CTT GTT AGA AAG *** GAT CAG GTA TAT ATT AAT TTT 

Thr Asn Asp sn Leu Val Arg Lys Asn Asp Gin Val Tyr ill J£ Phe 3456 
AAq ° 1145 1150 

ki s s is s s: ss k ^ s s s *- 

S E £ SK - S SS £ S S JSJS « S »» 
IS Si St S S K - E S SJ J2 S V S S? IS •«•• 

1190 1195 1200 

Phi ?* T AAT GGA AAT WT ATT Q GG TTG TTA GGT TTC AAG GCA 

Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly III Kg J 3648 

1205 1210 12 15 

tsl ?hr vll Val A?I f T ?AT TAT ACA CAT ATG AGA GAT CAT 

Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His 
1220 1225 123Q 

A h A AAC c GC AAT GGA TGT TTT TGG TTT ATT TCT GAA GAA CAT GGA 

in, Asn S« 5 A Sn Gly Cys Phe Trp^Asn Phe lie Ser Glu His G^y 

TGG CAA GAA AAA TAA 

Trp Gin Glu Lys 3759 
1250 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1252 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Pro Lys lie Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp Arg 

5 10 15 

Thr lie Leu Tyr He Lys Pro Gly Gly Cys Gin Glu Phe Tyr Lys Ser 



30 



Phe Asn lie Met Lys Asn lie Trp lie lie Pro Glu Arg Asn Val lie 
Jb 40 45 

Gly Thr Thr Pro Gin Asp Phe His Pro Pro Thr Ser Leu Lys Asn Gly 

Asp ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Glu Glu Lys 

70 7 5 80 

Asp Arg Phe Leu Lys lie Val Thr Lys lie Phe Asn Arg He Asn Asn 

85 9° 95 

Asn Leu Ser Gly Gly lie Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 no 

Tyr Leu Gly Asn Asp Asn Thr Pro Asp Asn Gin Phe His He Gly Aso 
US 120 125 Y P 

Ala ser Ala Val Glu He Lys Phe Ser Asn Gly Ser Gin Asp He Leu 
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3696 



3744 
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10 



15 



20 



30 



40 



45 



50 



60 



65 



70 



130 

Leu Pro Asn 
145 

Asn Ser Ser 

Gly Phe Gly 

Arg Phe Asn 
195 

Thr Leu Met 
210 

Lys Gly He 
22S 

He Thr Asn 

Gly Thr Asp 

Thr Asn Leu 
275 

Val Gin Val 
290 

Ala Lys Tyr 
305 

He Asn Lys 
Phe Asp Leu 



135 



140 



Val He 

Asn He 
165 

Ser He 
180 

Asp Asn 

His Glu 

Thr Thr 

lie Arg 
245 

Leu Asn 
260 

Leu Ala 
Ser Asn 
Gly Leu 



Phe Asn 
325 

Ala Thr 
340 



Gly Gin Tyr 
355 

Tyr Asn He 
370 

Arg Gly Gin 
3B5 

Gly Arg Gly 

Ser Val Lys 

Glu Leu Phe 
435 

Asn Thr Pro 

450 

Glu Asn Asp 
465 

Pro Gly Leu 
Tyr He Pro 



Lys Tyr 

Ser Glu 

Asn Ala 

Leu Val 
405 

Gly He 
420 

Phe Val 
Lys Glu 
Leu Asp 



Ser Asp 
485 

Lys Tyr 
500 



He Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
150 155 i6o 

Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
170 175 

Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
185 190 

Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
200 205 

Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
215 220 

Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
230 235 2 40 

Gly Thr Asn lie Glu Glu Phe Leu Thr Phe Gly 
250 255 

He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
265 270 

Asp Tyr Lys Lys He Ala Ser Lvs Leu Ser Lys 
280 2 g 5 

Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
295 3 0 o 

Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
310 315 320 

Asp He Phe Lys Lys Leu Tvr Ser Phe Thr Glu 
330 ' 335 

Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
345 35 0 

Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
360 365 

Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
375 380 

Asn Leu Asn Pro Arg He He Thr Pro He Thr 
390 395 400 

Lys Lys lie He Arg Phe Cys Lys Asn lie Val 
410 415 

Arg Lys Ser He Cys He Glu He Asn Asn Gly 
425 430 

Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
440 ' 445 

He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
45 5 460 

Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
470 475 48o 

Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
490 495 

Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
505 5i 0 



321 
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Asp val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 

520 525 
Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser j le Asp Tnr Ala 

bJ5 540 

545 LeU Glu Gln Pro Ile Thr Phe Phe Ser Ser Glu Phe lie 

550 555 560 

Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 

570 575 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 

585 590 

Val Asp Lys lie Ala Asp He ser lie Val Val Pro Tyr He Gly Leu 



605 

Ala Leu Asn lie Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 

615 620 
Leu Glu Leu Leu Gly Ala Gly U e Leu Leu Glu Phe Glu Pro Glu Leu 

635 640 
Leu lie Pro Thr lie Leu Val Phe Thr lie Lys Ser Phe Leu Gly Ser 

645 650 65 £ 

Ser Asp Asn Lys Asn Lys Val He Lys Ala lie Asn Asn Ala Leu Lvs 

665 670 



Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe lie Val Ser Asn 

680 685 

Trp Met Thr Lys lie Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 

695 700 
Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala lie Lys Thr lie lie Glu 

710 7 *5 7 20 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 



725 '30 735 



Lys Tyr Asp lie Lys Gin lie Glu Asn Glu Leu Asn Gin Lys Val Ser 
He Ala Met Asn Asn lie Asp Arg Phe Leu Thr Glu Ser Ser lie Ser 



765 



Tyr Leu Met Lys Leu lie Asn Glu Val Lys lie Asn Lys Leu Arg Glu 

775 780 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asn Tyr lie lie Gin His 

790 7 " 800 

Gly Ser lie Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val Thr 

805 810 815 

Asp Thr Leu Asn Asn Ser lie Pro Phe Lys Leu Ser Ser Tyr Thr Asp 

825 830 
Asp Lys lie Leu lie Ser Tyr Phe Asn Lys Phe Phe Lys Arg lie Lys 

Ser ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 



860 



Thr Ser Gly Tyr Asp Ser Asn lie Asn lie Asn Gly Asp Val Tyr Lys 

875 880 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly lie Tyr Asn Asp Lys Leu Ser 
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885 890 



895 



Glu val Asn lie Ser Gin Asn Asp Tyr lie lie Tyr Asp Asn Lys Tyr 
900 90S 910 

Lys Asn Phe Ser He Ser Phe Trp Val Arg lie Pro Asn Tyr Asp Asn 
3 920 925 

Lys lie Val Asn Val Asn Asn Glu Tyr Thr lie lie Asn Cys Met Arg 



940 



Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He n e 

950 955 960 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 g75 

Tyr Gly Asn Ala Asn Gly U e Ser Asp Tyr lie Asn Lys Trp He Phe 

985 9 90 

Val Thr Ue Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie Asn 

1000 1005 

Gly Asn Leu lie Asp Gin Lys Ser He Leu Asn Leu Gly Asn lie His 

1015 !020 

loas 5 " ASP 116 L L t U 30 Phe LyS Ile Val J-^X- Ser Tyr Thr Arg^ 

Tyr lie Gly He Arg Tyr Phe Asn lie Phe Asp Lys Glu Leu Asp Giu 

1045 1050 10 | 5 

Thr Glu lie Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn lie Leu 

1060 1065 1070 

Lys Asp Ph^Trp Gly Asn Tyr Leu^eu Tyr Asp Lys Glu Tyr Tyr Leu 



1085 



Leu Asn val Leu Lys Pro Asn Asn Phe lie Asp Arg Arg Lys Asp ser 

1095 1100 

Thr Leu Ser I le Asn Asn lie Arg Ser Thr lie Leu Leu Ala Asn Arg 

1110 1H5 1120 

Leu Tyr Ser Gly lie Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
1125 H30 i 135 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
H40 1145 115Q 

Val Ala ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 

J-J-" 1160 



1165 



Thr Asn Lys Glu Lys Thr He Lys lie Ser Ser Ser Gly Asn Arg Phe 

1175 1180 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 

1190 H95 1200 

Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lyc Ala 

1205 1210 1215 

Asp Thr val Va^Ala Ser Thr Trp Ty^Tyr Thr His Met Arg Asp His 



1230 



Thr Asn Ser Asn Gly C ys Phe Trp Asn Phe lie Ser Glu Glu His Gly 
Jb 1240 1245 



Trp Gin Glu Lys 
1250 
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(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1463 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

Ux) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 108.. 1460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 
Hxs His His His His His His His His Ser Ser Gly His He Glu Gly 
5 10 15 



60 
116 

164 

212 



308 



CGT CAT ATG GCT AGC ATG GCT CTT TCT TCT TAT ACA GAT GAT AAA ATT 
Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys He 
"° 25 30 35 

TTA ATT TCA TAT TTT AAT AAG TTC TTT AAG AGA ATT AAA AGT AGT TCT 260 
Leu lie Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys Ser Ser Ser 
40 45 50 

GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT ACT TCA GGA 
Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp Thr Ser Gly 
55 60 6 5 

TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA TAT CCA ACT 3 56 

Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys Tyr Pro Thr 
70 75 so 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 4 04 

Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser Glu Va] Asn 
g 5 90 95 

ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT AAA AAT TTT 4 52 

SO Gln ASn ASP Tyr Ue Ue T Y r As P Asn L V5 Tyr Lys Asn Phe 

J0 ° 105 li0 115 

AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT AAG ATA GTA 500 
Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn Lys He Val 
55 120 125 130 

AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGG GAT AAT AAT 54 8 

Asn Val Asn Asn Glu Tyr Thr He lie Asn Cys Met Arg Asp Asn Asn 
135 140 145 

60 TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT TGG ACA TTG 5 96 

Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He lie Trp Thr Leu 
150 155 160 

CAA GAT AAT TCA GGA ATT AAT CAA AAA TTA GCA TTT AAC TAT GGT AAC 644 
Gin Asp Asn Ser Gly lie Asn Gin Lys Leu Ala Phe Asn Tyr Gly Asn 
165 170 175 

GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT GTA ACT ATA 6 92 

70 AGn ° ly 116 Ser Asp Tyr lle Asn L Y S T ^P He Phe Val Thr lie 

n 180 185 i9o 195 



- 324 - 
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50 



ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 
Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie Asn Gly Asn Leu 
200 205 210 

ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT GTT AGT GAC 
lie Asp Lys Lys Ser lie Leu Asn Leu Gly Asn He His Val Ser Asd 
215 220 2 25 



ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 
lie Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu Thr Glu He 
245 250 255 



CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA AAG GAT TTT 
Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn lie Leu Lys Asp Phe 
20 265 270 275 

TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 
Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu Leu Asn Val 
280 265 290 



GGA TTT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA GAA AAA 
Gly Phe Phe Trp Asn Phe He Ser Glu Glu Hxs Gly Trp Gin Glu Lys 
440 445 450 

65 TAA 

(2) INFORMATION FOR SEQ ID NO: 54: 



70 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 



740 



7B8 



AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT GGT 8 36 

IU Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg Tyr He Gly 
230 235 240 



884 



932 



980 



1028 



TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT ACT TTA AGC 
Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser Thr Leu Ser 
295 300 3 05 

iTl T T rT A AG ° ACT ATT CTT TTA GCT AGA TTA TAT AGT 1076 

He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 

310 315 320 

GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 1124 
Gly He Lys Val Lys lie Gin Arg Val Asn Asn Ser Ser Thr Asn Asp 
JJ 3.5 330 335 

AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 1172 
Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe Val Ala Ser 
40 345 350 355 

AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC ACA AAT AAA 1220 
Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr Thr Asn Lys 
360 365 370 

45 GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT AAT CAA GTA 

Glu Lys Thr lie Lys He Ser Ser Ser Gly Asn Arg Phe Asn Gin Val 
375 380 385 

GTA GTT ATG AAT TCA GTA GGA AAT TGT ACA ATG AAT TTT AAA AAT AAT 1316 
✓al Val Met Asn Ser Val Gly Asn Cys Thr Met Asn Phe Lys Asn Asn 
J9 ° 395 400 

AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT GTA GTT 1364 

Asn Gly Asn Asn lie Gly Leu Leu Gly Phe Lys Ala Asp Thr Val Val 
405 410 415 

c GT ^ T I GG TAT TAT ACA CAT ATG AGA GAT AAT ACA AAC AGC AAT 1412 

Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn Ser Asn 

60 425 430 435 



1268 



1460 



1463 



- 325 - 
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<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



Met Gly His His His His His His His His His His Ser ser Gly His 

5 10 15 

He Glu Gly Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp 

25 30 
Asp Ly S Ue Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 



15 



55 60 



Thr Ser Gly Tyr Asp Ser Asn lie Asn lie Asn Gly Asp Val Tyr Lys 

75 80 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
85 90 95 

Glu Val Asn lie Ser Gin Asn Asp Tyr He He Tvr Asp Asn Lvs Tvr 
100 105 



110 



Lvs Asn Phe Ser He Ser Phe Trp Val Arg lie Pro Asn Tyr Asp Asn 
115 120 125 

Lys lie Val Asn Val Asn Asn Glu Tyr Thr He lie Asn Cvs Met Are 

135 140 

3-"» Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu lie He 

Trp Thr Leu Gin Asp Asn Ser Gly lie Asn Gin Lys Leu Ala Phe Asn 
40 165 170 175 

Tyr Gly Asn Ala Asn Gly lie Ser Asp Tyr lie Asn Lys Trp lie Phe 
180 185 19 q 

4 . Val Thr lie Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie Asn 

lyb 2 °0 205 



Gly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 



220 



Val Ser Asp Asn lie Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg 

230 235 240 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
55 245 250 255 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
260 265 270 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Gl u Tyr Tyr Leu 



285 



Leu Asn val Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser 

Thr Leu Ser He Asn Asn He Arg Ser Thr lie Leu Leu Ala Asn Arg 

5 310 315 320 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 

70 325 330 335 



- .126 - 
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Thr Asn Asp Asn Leu 
340 

Val Ala Ser Lys Thr 
355 

Thr Asn Lys Glu Lys 
370 

Asn Gin Val Val Val 
385 

Lys Asn Asn Asn Gly 
405 

Thr Val Val Ala Ser 
420 

Asn Ser Asn Gly Phe 
435 

Gin Glu Lys 
450 



Val Arg Lys Asn Asp Gin 
345 

His Leu Leu Pro Leu Tyr 
360 

Thr lie Lys lie Ser Ser 
375 



Met Asn Ser Val Gly Asn 
390 395 

Asn Asn lie Gly Leu Leu 
410 

Thr Trp Tyr Tyr Thr His 
425 

Phe Trp Asn Phe lie Ser 

440 



Val Tyr lie Asn Phe 
350 

Ala Asp Thr Ala Thr 
365 

Ser Gly Asn Arg Phe 
380 

Cys Thr Met Asn Phe 
400 

Gly Phe Lys Ala Asp 
415 

Met Arg Asp Asn Thr 
430 

Glu Glu His Gly Trp 
445 



\?.) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc - "DNA n 

(ix> FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 10B..1463 

(X-) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

Hil His ml CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 

His His His His His His His His His Ser Ser Gly His He Glu Glv 

5 10 15 ' 

Ara Hil mI G ?T AGC ATG GCT CTT TCT TCT TAT ACA GAT GAT AAA ATT 
Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys lie 

25 30 35 

TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA ACT AGT TCA 
Leu lie Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lvs Ser Ser Ser 
40 45 ' so 

GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT ACT TCA GGA 
Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp Thr Ser G^v 
55 60 G5 

Itl ttl I CA ?* T rT A ATT ^ GGA GAT GTA TAT ^ TAT CCA ACT 

Tyr Asp ser Asn lie Asn lie Asn uly Asp Val Tyr Lys Tyr Pro Thr 

70 7 5 80 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 

Asn Lys Asn Gin Phe Glv He Tyr Asn Asp Lys Lou Sor Glu Val Asn 
85 9° 95 



60 
116 



164 



212 



260 



306 



356 



4 04 
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' S S Si S £ £ JS 35 £ £ i£ E ?" £ SI S «" 
E SI E S SJ S3 « K 5" K ™ « - « - ™ 

120 "5 130 

2 SII SI JS SK £ S J2 ;u iS 51 51? £ 2* 2 "I 

s sj S K Sf 22 21 XS SS i£ ss S SI 5 s US »- 



3i 2S S °S SS 51 i£ S s SI S £ £ 

20 170 175 



185 19° 195 

^ ThI ^ T « AT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 

Thx Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr Me Asn Gly ^ lit 

205 210 

»() tT* GAT *** TCA ATT TTA *»* TTA GGT AAT ATT CAT GTT AGT GAC 

He Asp Gin Lys Ser He Leu Asn Leu Gly Asn lie His vll ser Asp 

220 - - 



225 



AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT rr-r 
Asn lie Leu Phe Lys lie Val Asn Cys Ser Tyr Thr Arg ™ ill SJ 

235 240 

tTI AGA T Z T AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 

He Arg T yr Phe Asn He Phe Asp Lys Glu Leu Asp cK Thr Glu lie 
4() 250 255 

CAA ACT TTA TAT AGC AAT GAA CCT AAT ACA AAT ATT TTG AAG GAT TTT 
Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn III Hu t£ SI 

270 27S 

TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 
HP G.y Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu SJ vll 



285 



290 



Uu Lys Pro Asn Asn Phe He Asp Arg" £g Lys Asp Ser ™ Ser 

295 300 

tTI A T A AGA AGC ACT ATT CTT TTA G CT AAT AGA TTA TAT AGT 

55 He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 

3X5 320 

GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 
Gly lie Lys Val Lys lie Gin Arg Val Asn Asn Ser Ser Thr A^n tsl 
60 5 330 335 

A^n ?r ST AGA ^ AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 
Asn Leu val Arg Lys Asn Asp Gin Val Tyr He Asn Phe vll Ala Ser 

345 350 355 

Lvs Thr S£ ™ HI C ™ ™ ™ GCT GAT A F A ?« A CC ACA AAT AAA 

Asn 
370 

Gl« Lys Thr He L^ys Vie Se7 Ser C^y ^ J£ 

- 328 



t rvZ „ ^ H liA 1AT GCT GAT ACA GCT ACC ACA AAT AAi 

Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr Thr ^ 
360 - 



500 



548 



644 



s ss % tn e si s e ^ ~ is s s s s: •» 



740 



788 



836 



884 



93: 



980 



50 Leu L^ III 21 HI tT ™ AGG AGA AAA GAT TCT ACT TTA AOC 



1076 



1124 



1172 



1220 



70 Glu L^ Thr "I" ^ lT A If" I CA If! G ? C AGA T? T AAT CAA GTA 
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375 380 

-II Val T I" G ^ GGA ^ TGT AC * ATG AAT TTT AAA AAT 

•al Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn Phe JJJ 1316 



400 



E si ss ^ T JS S S E K SJ £ £ s s s S 



415 



21 S £ S S S S £ £ SE £ £ SS S « - - 



430 435 



"n T SJ SI 3 £ IS SI 31 E Si 21 SI £ Si Si »«• 



445 4S0 



AAA TAAAAGCTT 

1472 

f2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 4 52 amino acids 
<B> TYPE: amino acid 
(D) TOPOLOGY: linear 

iii> MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
Met Gly His His His Has His His His His His His Ser Ser Gly His 



5 



10 



He Glu Gly Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp 

25 30 

Asp Lys lie Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg lie L ys 

JD 40 45 7 

Ser ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 

it3 60 
Thr ser Gly Tyr Asp Ser Asn He Asn lie Asn Gly Asp Val Ty: Lys 

75 80 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly lie Tyr Asn Asp Lys Leu Ser 

85 9 ° 95 ' 

Glu Val Asn lie Ser Gin Asn Asp Tyr lie He Tyr Asp Asn Lys Tyr 

105 110 
Lys Asn Phe Ser lie Se r Phe Trp Val Arg lie Pro Asn Tyr Asp Asn 

120 125 

Lys lie val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 

13j 140 
A.p Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

150 155 160 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
5 170 175 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He P he 

185 19Q 

Val Thr lie Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
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Gly Asn Leu lie Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 

21 5 220 

. Val Ser Asp Asn He Leu Phe Lys lie val Asn Cys Ser Tyr Thr Arq 

" 5 230 235 240 

Tyr lie Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
245 2 5 0 25 j: 

10 Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
260 265 270 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
|5 275 280 285 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser 
^ yU 295 300 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
5 310 315 3 20 

Leu Tyr Ser Gly Ik Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
3 -5 330 335 

2* Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 

340 345 350 

Val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
355 360 365 

Thr Asn Lys Glu Lys Thr He Lys lie Ser Ser Ser Gly Asn Arg Phe 
J U 375 3 8 o 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 
385 390 395 400 

Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 
405 410 415 

Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His 
420 425 43 0 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Glv 
43 5 4 4 o 445 

Trp Gin Glu Lys 
450 



30 



40 



45 



(2) INFORMATION FOP SEQ ID NO: 57: 

U) SEQUENCE CHARACTERISTICS: 
I A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA 1 ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CGCCATGGCT CTTTCTTCTT ATACAGATGA T 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 9 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



50 



55 



- 330 - 
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48 



96 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GCAAGCTTTT ATTTTTCTTG CCATCCATG 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3876 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
( D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3873 

<:<i> SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ATG CCA ATA ACA ATT AAC AAC TTT AAT TAT TCA GAT CCT GTT GAT AAT 
Met Pro He Thr lie Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 

AAA AAT ATT TTA TAT TTA GAT ACT CAT TTA AAT ACA CTA GCT AAT GAG 
Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 
20 25 30 

CCT GAA AAA GCC TTT CGC ATT ACA GGA AAT ATA TGG GTA ATA CCT GAT 
Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Asp 
^ 5 40 45 ^ 

a™ dh T I CA AGA AAT TCT AAT CCA MT TTA AAT ^ CCT CCT CGA GTT 
Aig Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val 
^° 55 60 

ACA AGC CCT AAA AGT GGT TAT TAT GAT CCT AAT TAT TTG AGT ACT GAT 
Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 
65 70 75 8 5 

TCT GAC AAA GAT ACA TTT TTA AAA GAA ATT ATA AAG TTA TTT AAA AGA 
■*> Ser Asp Lys Asp Thr Phe Leu Lys Glu lie He Lvs Leu Phe Lys Arg 

85 90 y5 

ATT AAT TCT AGA GAA ATA GGA GAA GAA TTA ATA TAT AGA CTT TCG ACA 
He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr Arg Leu Ser Thr 
100 105 no 

Asn tTo o CC II 7 CCT G ? G ** T ** C ACT CCA ATT AAT ACT TTT GAT 

Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro lie Asn Thr Phe Asp 
115 120 125 

TTT GAT GTA GAT TTT AAC AGT GTT GAT GTT AAA ACT AGA CAA GGT AAC 
Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg G^ Gly US 
XJU 135 140 

AAC TGG GTT AAA ACT GGT AGC ATA AAT CCT AGT GTT ATA ATA ACT GGA 
Asn Trp Val Lys Thr Gly Ser lie Asn Pro Ser Val He He Thr GlC 

150 155 i60 

65 PkI ^ AAC ATT ATA GAT CCA GAA ACT TCT ACG TTT AAA TTA ACT 5 , fl 

to Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 

165 170 175 



144 



192 



240 



288 



336 



384 



432 



480 
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AAC AAT ACT TTT GCG GCA CAA GAA GGA TTT GGT GCT TTA TCA ATA ATT 
Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala ™ llr He lie 
180 185 190 

TCA ATA TCA CCT AGA TTT ATG CTA ACA TAT ACT AAT GCA ACT AAT GAT 
Sei lie Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 
195 200 205 



CTA ATT TTA ATG CAT GAA CTT AAT CAT GCA ATG CAT AAT TTA TAT GGA 
Leu lie Leu Met His Glu Leu Asn His Ala Met His aVh Lei Tyr G X y 



230 "5 240 



7lt 17 ?T n CA *** GAT CAA ACA ATT TCA TCT GTA ACT ACT AAT ATT 
He Ala lie Pro Asn Asp Gin Thr He Ser Ser val Thr Ser Asn lie 

20 245 250 255 

TTT TAT TCT CAA TAT AAT GTG AAA TTA GAG TAT GCA GAA ATA TAT GCA 
Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu lie ?JJ 
260 265 270 

25 III °? T CCA ACT ATA GAC CTT ft TT CCT AAA ACT GCA AGG AAA TAT 

Phe Gly Gly Pro Thr lie Asp Leu lie Pro Lys Ser Ala Arg Lys Tyl 

nu« » , "7 " **"»* ili Mia GCT AAA AGA CTT 

Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala £yt Arg Leu 



Asn Sel lie ?hl Thr Ala ^ 7" AGC TTT ^ TAT ATA GGG 

Asn ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr lie Gly 

310 315 326 



GAA TAT AAA CAG AAA CTT ATT AGA AAG TAT AGA TTC GTA GTA GAA TCT 
Glu ryr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu III 
40 325 330 335 



TCA GGT GAA GTT ACA GTA AAT CGT AAT AAG TTT GTT GAG TTA TAT AAT 

Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyl ftln 

340 345 35 0 

GAA CTT ACA CAA ATA TTT ACA GAA TTT AAC TAC GCT AAA ATA TAT aat 

Glu Leu Thr Gin lie Phe Thr Glu Phe Asn Tyr 17a L^ S Tyl 

355 360 365 

Val Gin Asn Arg Lys He Tyr Leu Sir £n va? Ty"r ThI Pro VaT ?hr 



Ala A^n 7* ^ GTT ™ T GAT ATA CAA ^AT GGA TTT AAT 

55 A af Leu Asp T£ Asn Val Tyr Asp Ile Gln Asn G1 V Phe Asn 

395 400 

lit ?** c GT AAT TTA ^ GTA CTA TTT ATG GGT CAA AAT TTA TCT 

He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 

60 405 410 415 

CGA AAT CCA GCA TTA AGA AAA GTC AAT CCT GAA AAT ATG CTT TAT TTA 

Aig Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 



430 



Phi Th^ f** IF l GT CAT *** GCA ATA GAT GGT AGA T ^A TTA TAT AAT 
Phe Thr Lys Phe Cys His Lys Ala lie Asp Gly Arg Ser Leu Tyr Asn 
435 440 



445 



576 



624 



10 Sit r? A AGA TTT TCT TCT GAA TTT TGG ATG GAT CCA ATA 67 -> 

H) Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro lie 2 

u 215 220 



720 



768 



816 



864 



30 P^ Glu G^ 2£ ™ G ^ ™ ™ AGA I GT ATA GCT AAA AGA CTT 

295 300 



960 



1008 



1056 



1104 



S() G ™ ^ ^ T f** A T A TAT CTT TCA AAT GTA TAT ACT G CG GTT ACG n 5 , 
Val Gin Asn Arg Lys Ile Tyr ' — ■■ - - — ii->2 

370 375 380 



1200 



1248 



1296 



1344 



70 Th^ I™ ^ T I GT AGA GAG CTT TTA GTT AAA AAT ACT GAC TTA CCC 139" 

'» Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 
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450 455 



460 



10 



15 



5i) 



TTT ATA GGT GAT ATT AGT GAT GTT AAA ACT GAT ATA TTT TTA AGA AAA 
Phe He Gly Asp He Ser Asp Vai Lys Thr Asp He Phe Leu Arq Lys 
465 470 4 75 480 

GAT ATT AAT GAA GAA ACT GAA GTT ATA TAC TAT CCG GAC AAT GTT TCA 
Asp lie Asn Glu Glu Thr Glu Val lie Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

GTA GAT CAA GTT ATT CTC AGT AAG AAT ACC TCA GAA CAT GGA CAA CTA 
Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu 
500 505 sio 

GAT TTA TTA TAC CCT AGT ATT GAC AGT GAG AGT GAA ATA TTA CCA GGG 
Asp Leu Leu Tyr Pro Ser lie Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 



TAT GAA TGG ATG ATG GGA ACG TGG TTA TCC AGG ATT ATT ACT CAA TTT 

M nXl GiU Trp Met Met Gly Thr Tr P Leu Ser Ar <? He He Thr Gin Phe 

°- 710 715 720 

AAT AAT ATA AGT TAT CAA ATG TAT GAT TCT TTA AAT TAT CAG GCA GGT 
Asn Asn He Ser Tyr- Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
7() 725 730 735 



1440 



1488 



1536 



1584 



GAG AAT CAA GTC TTT TAT GAT AAT AGA ACT CAA AAT GTT GAT TAT TTG 16 3' 

-0 Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 

530 535 54 0 



1680 



1728 



1776 



1824 



AAT TCT TAT TAT TAC CTA GAA TCT CAA AAA CTA AGT GAT AAT GTT GAA 
Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
- D 545 550 555 560 

GAT TTT ACT TTT ACG AGA TCA ATT GAG GAG GCT TTG GAT AAT AGT GCA 
Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 
3(J 565 570 575 

AAA GTA TAT ACT TAC TTT CCT ACA CTA GCT AAT AAA GTA AAT GCG GGT 
uys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Glv 
580 585 59Q 

3 3 GTT CAA GGT GGT TTA TTT TTA ATG TGG GCA AAT GAT GTA GTT GAA GAT 

/al Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 

JO ll T fP A ? h ATT CTA AGA GAT ACA TTA GAT AAA ATA TCA GAT 18 72 

W Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 

610 615 620 

GTA TCA GCT ATT ATT CCC TAT ATA GGA CCC GCA TTA AAT ATA AGT AAT 
Val Ser Ala He He Pro Tyr lie Gly Pro Ala Leu Asn He Ser Asn 
4 ~ 525 630 6 35 6 40 

TCT GTA AGA AGA GGA AAT TTT ACT GAA GCA TTT GCA GTT ACT GGT GTA 
Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
64 5 650 655 

ACT ATT TTA TTA GAA GCA TTT CCT GAA TTT ACA ATA CCT GCA CTT GGT 
Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Glv 
660 665 670 

GCA TTT GTG ATT TAT AGT AAG GTT CAA GAA AGA AAC GAG ATT ATT AAA 
Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lvs 
675 680 685 



1920 



1968 



2016 



2064 



AO ^ T tT A ? AT MT TGT TTA GAA CAA AGG ATT AM AGA T GG AAA GAT TCA 211^ 

0» Thr lie Asp Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser 

690 * 695 700 



:i6o 



2208 
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30 



35 



65 



70 



GCA ATC AAA GCT AAA ATA GAT TTA GAA TAT AAA AAA TAT TCA GGA ACT 
Ala lie Lys Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser G^y s2r 



2256 



750 



GAT AAA GAA AAT ATA AAA AGT CAA GTT GAA AAT TTA AAA AAT APT m 
LyS Asn Ile ^ Gin Val Glu Asn Si Lys £n Ser Si 



-o Sp £? £ ?S S SJ s E Asn IE i2 12 - s K 2J 

775 780 

Si S£ S SI? f£ J}! ™ "* J** MT ATG "A CCT AAA GTA ATT 
15 785 l££ LSU PhC Aon Met Le " Lvs Val He 

790 7 " • aoo 

GAT GAA TTA AAT GAG TTT GAT CGA AAT ACT AAA GCA AAA TTA ATT aa-r 
Glu Leu Asn «« Asp Arg Asn Thr Eyt £i f£ Si ?S 
20 805 81 ° 815 



l-u Val Phe Thr Leu Lys Oln A^n Glu £ P Ser G?u G™ sTr SJ 
980 985 

TTT AGT TAT GAT ATA TCA AAT AAT GCT CCT GGA TAC AAT AAA TGG TTT 
Phe Ser Tyr Asp lie Ser Asn jjnAla Pro Gly Tyr jin JJi IT P HI 

III tk T GT 7 ACT AAC ^ ATG ATG GG A AAT ATG AAG ATT TAT ATA 

Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys iS lyl i™ 

- 334 - 



2352 



2400 



-p a: „ £ ^ s; is S £ S K 35 S £ 2 " ! 

8X0 81 

s s; s si £ ?n s a sr a; s xs k k 
s s k s? 2 s s s sj is s js s S e ;n 

840 845 

Phi I° A I AT ACT AAT AAT TCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 
Phe ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp Ili nl ^ £J ™J 

855 860 

s s E m s s s t, m lis s s s; 2 s s 

S £ SS S S S £ £ ~ s; -5 s « S 

40 885 8 90 895 

GGC GAT GTT CAG CTT AAT CCA ATA TTT CCA TTT GAC TTT AAA TTA GCT 
Gly Asp val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Si £J 

900 905 9 i 0 

45 Ser Si SS A^ f A n? T *** GTT ATA GTA ACC GAG AAT GAA AAT 

Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu Asn 

920 g 25 

» E S £1 ^ IS SI? £ s; S £1 « - - - « - 

935 

Ari tT A ^ ?** I GG GTA AGT AAT CC T GGA TAT ACT ATA ATT GAT 

55 94I Ue ASn LyS TrP 5f J Ser Asn Gly Tyr Thr lli iS ts P 

Ser SIT ^ AAT ^ C I CA GGT TGG AGT ATA GGT ATT ATT AGT AAT TTT 
Ser val Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn He 

60 965 970 97 5 

Si Sit III 5£ ™ ^ ^ ^1 GAA GAT AG - G AA CAA AGT ATA AAT 



2592 



2640 



2688 



2736 



2880 



2928 



2976 



3024 



3072 
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1010 1015 



1020 



3120 



3168 



£n i£ lit Ue Asp {£ ill Ts ST r ^ CTA ACT GGA ATT 
1025 in?n Y 31 LyS Glu Leu Thr Gl Y He 

1030 1035 1040 

HI SI £ £ S JS S SI SI 5S £ ?II 52 £ £ 
E ill £S 2J E S Hj S°HI !I? 5» ~ J" 
SI £ SE E £ SS K JS I» £ IE HIS US 

1080 1085 
TTT AAT AGC TTG CAA TAT ACT AAT ptt r-r» 

Phe Asn Ser Leu Gin Tyr Thr A^n SI? SI? GAT ™ T TGG GGA ™ T 
1090 y 1095 31 LyS Asp Tyr Tr P G1 V Asn 

S III S ™I HI £ SI £ s ™ -i « K HI # HI 

1110 1"5 1120 

A^n a™ m TG TAT GCG AAC TCA CGA CA * ATT GTT TTT AAT ACA CGT 

A,n Aig Tyr Met Tyr Ala Asn Ser Ar 9 Gin He Val PhT Thr Arg 

1130 

JS HI HI HI S< SI £ Si SjS HI SI S S 

hi s hi s hi hi s is «? ™ « si s 

™ SI HI SI? S K HI - ™ S « - ™ S S «. « 

1175 1180 

Thr Mec lyl Ma Asl f T ACT GAA GAT A ™ ™T GCT 

1185 ^ P ^ HlS Thr G1U AS P lle Tyr Ala 

1195 1200 



3216 



3264 



3312 



3360 



3408 



3456 



3552 



3600 



s ss ei is si si s e si - « - - - s K .... 

1210 -i 

SI IK SI SS HI HI S £ ™? - ™ IH SK S2 »« 



is s se hi s si is in hi sj s: s i» ™ sj »« 

S £S SI IS E £SIHI S # IS S HI ?II us »» 

1255 1260 

SI? P C r C o T val ^ £1 t™ S £ 27* 2™ T T ° A ACA 3840 

126 5 7?™ y Ser Leu Leu Glu Ser Thr 

1270 12'5 1280 

TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAA 

Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 3876 
l 2 " 1290 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 91 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Pro He Thr lie Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 
1 5 io 



15 



Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 
20 25 30 

Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Asp 
35 40 - - ^ 



45 



Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arq val 
50 55 60 

->c T ^ Ser Pro L V S Ser G1 V Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 

65 70 75 8 £ 

Ser Asp Lys Asp Thr Phe Leu Lys Glu He lie Lys Leu Phe Lys Arg 
85 90 y5 

30 u e Asn Ser Arg Glu He Gly Glu Glu Leu lie Tyr Arg Leu Ser Thr 

100 105 110 

Asp lie Pro Phe Pro Gly Asn Asn Asn Thr Pro He Asn Thr Phe Asp 
115 120 125 

Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gin Gly Asn 
130 135 140 

Asn Trp val Lys Thr Gly Ser He Asn Pro Ser Val lie He Thr Gly 

40 145 150 155 ls l 

Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 
165 170 175 



Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Leu Ser lie lie 

180 185 190 

Ser lie Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 

195 200 



205. 



Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro lie 
210 215 220 

Leu lie Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 

^ 225 230 235 Y 240 

He Ala lie Pro Asn Asp Gin Thr He Ser Ser Val Thr Ser Asn lie 
245 250 255 

Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 
260 265 



270 



Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser Ala Arg Lys Tvr 
275 280 285 

Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala Lys Arg Leu 

^ 90 295 300 

Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr He Gly 
305 310 315 320 

- 336 - 



WO 98/08540 



PCT/US97/15394 



10 



15 



20 



30 



40 



45 



50 



60 



65 



70 



Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu Ser 
325 330 335 

Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 
340 345 350 

Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala Lys He Tyr Asn 
355 360 365 

Val Gin Asn Arg Lys lie Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 
370 375 380 

Ala Asn He Leu Asp Asp Asn Va.l Tyr Asp He Gin Asn Gly Phe Asn 
385 390 395 400 

He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 
405 410 4i 5 

Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 

Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 ' 445 

Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 
450 455 460 

Phe Lie Gly Asp He Ser Asp Val Lys Thr Asp lie Phe Leu Arq Lys 
465 470 475 480 

Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu 
500 505 510 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 

Asn ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
545 550 555 560 

Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 
565 570 575 

Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 

Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 

Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 

Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn 
6 ^ 5 630 635 640 

Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 

Thr lie Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 

Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu lie lie Lys 
675 680 685 

Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser 
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WO 98/08540 



PCT/US97/15394 



690 695 



700 
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15 



40 



45 



50 



Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg lie lie Thr Gin Phe 

5 710 715 720 

Asn Asn lie Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
725 730 735 

Ala He Lys Ala Lys lie Asp Leu Glu Tyr Lys Lys Tyr Ser Glv Ser 

740 745 750 

Asp Lys Glu Asn lie Lys Ser Gin- Val Glu Asn Leu Lys Asn Ser Leu 

755 760 765 

Asp Val Lys lie Ser Glu Ala Met Asn Asn He Asn Lys Phe lie Arg 



780 



Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He 
20 790 79 * 800 

Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu lie Asn 
805 810 81S 

Leu lie Asp Ser His Asn lie lie Leu Val Gly Glu Val Asp Lys Leu 
8 *° 825 

Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr lie Pro Phe Asn lie 
835 840 845 

30 Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He lie Asn Glu Tyr 
850 855 aeo Y 

Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin Asn Arg Lvs 
865 870 875 880 

Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Ser Glu Glu 
885 890 895 

Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys Leu Glv 
900 905 910 

Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu Asn 
915 920 925 

He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp He 
930 935 940 

Arg lie Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr He lie Asp 
945 950 955 960 

Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He lie Ser Asn Phe 
965 970 975 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser lie Asn 
" J 980 985 g90 

Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lvs Trp Phe 
995 1000 1005 ' 

60 Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys He Tyr He 
1010 1015 1020 

Asn Gly Lys Leu He Asp Thr He Lys Val Lys Glu Leu Thr Gly He 
65 1025 1030 1035 10 40 

Asn Phe Ser Lys Thr lie Thr Phe Glu He Asn Lys He Pro Asp Thr 
1045 1050 1055 

Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arq Asp 
70 1060 1065 ^ 1070 
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Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He Asn He Leu 
1075 1080 1085 

Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr Trp Glv Asn 
1090 1095 iioo 

Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn He Asp Tyr Leu 
H°5 H10 ins ino 

Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
H25 H30 1135 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He He Lys Arq 
H40 H45 iiso 

He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp He Leu 
1155 1160 lies 

Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu Phe Met Lvs 
H* 70 1175 H80 

Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tvr Ala 
1185 1190 1195 1200 

He Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn He lie Phe 
1205 1210 1215 

Gin lie Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin lie Phe 
1220 1225 1230 

Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cys Ser He Gly 
1235 1240 1245 

Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 
J - 1250 1255 1260 



30 



40 



50 



60 
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70 



Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu Ser Thr 
1265 1270 1275 1280 

Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
1285 1290 



(2) INFORMATION FOR SEQ ID NO: 61: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
^ (B) LOCATION: 108.. 1493 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 61: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT U 6 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His He Glu Glv 
5 10 is Y 

CGT CAT ATG GCT AGC ATG GCT TTA TTA AAA GAT ATA ATT AAT GAA TAT -12 
Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He Asn Glu Tyr 
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20 25 

55 2 IE K S £ S £ 31 £S J* S S! i£ £ K 



45 



50 



IS S K S| s; S S £ S !E S SK SI? K SI Si 

10 55 eo 65 



90 95 



ATT GTA TAT AAT TCT ATG TAT GAA AGT TTT AGC ATT APT TTT -mn 
lie val Tyr Asn Ser Met Tyr ciu Ser Phe Ser ill 5£ ^ ™ {J 



105 



110 



lie Irp S2 ^ f T 7™ CCT GGA ™ T ACT ATA ATT GAT 

S tie Asn Lys Trp Val Sei Asn Leu Pro Gly Tyr Thr lie lie Asp 

1<.0 12 5 * 



130 



190 195 



AAT GGA AAA TTA ATA GAT ACT ATA AAA GTT AAA GAA CTA ACT CPA «tt 
4 . Asn Gly Lys Leu lie Asp Thr lie Lys Val Lys Glu Leu Thr Gly ill 



205 



210 



AAT TTT AGC AAA ACT ATA ACA TTT GAA ATA AAT AAA ATT CCA CAT APP 

Aon Phe Ser Lys Thr lie Thr Phe Glu lie £n Lys He Pro AsJ Thr 
50 ' 15 220 225 

G G y Uu ill Thr ler %1 TJ S" ^ ^ ™ ™ TGG ATA AGA GAT 

y Leu lie Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arg Asp 



240 



Phe Tyr lie EI Jyt £J leu T GAT ATT AAT ATA ™ 

> e *ne Ala Lys Glu Leu Asp Gly Lyo Asp lie Asn lie Leu 

^50 255 



-j. s „ Leu „„ , Thr z» si «r - si ^ s; sj £ 

270 



275 



GAT TTA AGA TAT AAT AAA GAA TAT TAT ATG GTT AAT ATA GAT TAT TTA 
Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn lie AsJ lyl lIu 

285 290 

AAT AGA TAT ATG TAT GCG AAC TCA CGA CAA ATT GTT TTT AAT ACA CCT 
Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin lie Val III ™l tS Arg 

70 95 300 305 



260 



308 



356 



oly tsl SI Gin 23 f T S CA A I A TTT ° CA TTT GAC TTT AAA TTA GGT 
Gly A„p Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys L.eu Gly 

75 80 

AGT TCA GGG GAG GAT AGA GGT AAA GTT ATA GTA ACC CAP AAT pa a »,,. 
Ser Ser Gly Glu Asp Arg Gly Lys Val He ?2 Thr G^n £n £J £1 



452 



500 



548 



£ S £ % E S S E S SJ IS K s E S 
K S JE S E £ Si i£ Si J" S Si Si K S iE '« 

Phi Sel ICI A A I n A I" ^ ^ T ° CT CCT GGA TAC AAT AAA TGG TTT 
Phe S.i Tyr Asp lie Ser Asn Asn Ala Pro Gly Tyr Asn Lys Trp Phe 

170 175 

40 JE S £ £ i£ z s ss sj - «° - j« »; ™ .» 

18 5 - *■* — 



644 



740 



788 



836 



864 



fi» £1 HZ II G ™ A " ™ G TT GTA ™T TAT TGG GGA AAT 



980 



1028 
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a™ ^ ^ ^ T GAC TTC AAT GAA GGA TAT ATT ATA ATA AAA AGA 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys lie He He Lys Arg 

310 315 320 

ATC AGA GGA AAT ACA AAT GAT ACT AGA GTA CGA GGA GGA GAT ATT TTA 
He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp lie Leu 

330 335 



TAT TTT GAT ATG ACA ATT AAT AAC AAA GCA TAT AAT TTG TTT ATG AAG 

Tyr Phe Asp Met Thr lie Asn Asn Lys Ala Tyr Asn Leu Phe Met lCs 
340 345 350 355 

A '} T ^ ^u T ATG TAT GCA GAT ^ CAT AGT A CT GAA GAT ATA TAT GCT 

asn Giu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tyr Ala 



370 



«TA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 
He Gly Leu Arg Glu Gin Thr Lys Asp lie Asn Asp Asn lie He Phi 
375 380 3 8 5 

CAA ATA CAA CCA ATG AAT AAT ACT TAT TAT TAC GCA TCT CAA ATA TTT 
Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala sir Gin He SI 

jyo 395 



f ™ T A ^ T V T AAT GGA GAA ATT TCT GGA ATA TGT TCA ATA GGT 

Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cvs Ser lie Gil 



415 



'i) SEQUENCE CHARACTERISTICS: 
k (AJ LENGTH : 462 amino acids 

4 * (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

M) <*i> SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
5 10 15 

X> Me Glu Gly Arg His Met Ala Ser Met Ala Leu Leu Lys Asp lie He 

20 25 30 

Asn Glu Tyr Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin 
60 35 40 45 

Asn Arg Lys Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val 

55 60 

Ser Glu Glu Gly Asp Val Gin Leu Asn Pro lie Phe Pro Phe Asp Phe 

70 75 eo 

Lys Leu Gly Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin 

85 90 95 

Asn Glu Asn He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser 

- 341 - 



1076 



1124 



1172 



1220 



1268 



1316 



1364 



M) Thr Z*Z ll T AGA CTT GGA GGT GAT TGG TAT AGA CAC **T TAT TTG 1412 

Ihr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 

** aU 425 430 435 



vll Pro Thr SI? ^ ^ GCT TCA T ™ TTA GAA TGA AGA 1460 

val Pio Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu spr Thr 

440 445 

Ser ?h! G ? T T u T GTA CCT GTA AGT GAA TAAAAGCTT 1502 

Ser Ihr His Trp Gly Phe Val Pro Val Ser Glu 
455 460 

(21 INFORMATION FOR SEQ ID NO: 62: 



100 105 



110 



Phe Trp lie Arg lie Asa Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr 
115 120 125 

He lie Asp Ser Val Lys Asn Asn Ser Gly Trp Ser lie Gly lie lie 
J0 135 140 

Ser Asn Phe Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin 

150 155 160 

Ser He Asn Phe Ser Tyr Asp lie Ser Asn Asn Ala Pro Gly Tyr Asn 
165 170 

Lys Trp Phe Phe Val Thr Val Thr Asn Asn Met Met Glv Asn Met Lys 
180 185 3 



190 



He Tyr lie Asn Gly Lys Leu lie Asp Thr He Lys Val Lys Glu Leu 
iy => 200 205 

Thr Gly He Asn Phe Ser Lys Thr He Thr Phe Glu He Asn Lys He 



220 



Pro Asp Thr Gly Leu lie Thr Ser Asp Ser Asp Asn lie Asn Met Trp 
230 235 2 4o 

He Arg Asp Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp lie 
245 250 255 

Asn He Leu Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr 
*-&0 265 270 

Trp Gly Asn Asp Leu Arg Tyr Asn Lys Glu Tyr Tvr Met Val Asn lie 
27b 280 ' 285 

Asp Tyr Leu Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin lie Val Phe 
290 295 300 

Asn Thr Arg Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys lie lie 
° S 310 31S 32Q 

He Lys Arg lie Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Glv 
325 330 33^ y 

Asp He Leu Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu 
340 345 35o 

Phe Met Lys Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp 
355 3 6 o 365 ^ 

He Tyr Ala lie Gly Leu Arg Glu Gin Thr Lys Asp lie Asn Asp Asn 

j/u 375 



380 



lie lie Phe Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser 
385 390 395 400 

Gin He Phe Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cvs 
4 °5 410 415 

Ser He Gly Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arq His 
420 4 25 430 

Asn Tyr Leu Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu 
435 440 445 

Glu Ser Thr Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 63: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : Single 
(D) TOPOLOGY: linear 



10 



15 



40 



60 



65 



<ii> MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

CGCCATGGCT TTATTAAAAG ATATAATTAA TG 

<2) INFORMATION FOR SEQ ID NO: 64: 



CCT GTA AAA GCT TTT ATG ATT ACT CAA AAT ATT TGG GTA ATA CCA GAA 
Pro val Lys Ala Phe Met lie Thr Gin Asn lie Trp Val He Pro Glu 
35 40 45 

AGA TTT TCA TCA GAT ACT AAT CCA AGT TTA AGT AAA CCG CCC AGA CCT 
Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arq Pro 
50 55 60 

ACT TCA AAG TAT CAA AGT TAT TAT GAT CCT AGT TAT TTA TCT ACT GAT 
Thr Ser Lys Tyr Gin Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp 

65 70 75 8 5 

GAA CAA AAA GAT ACA TTT TTA AAA GGG ATT ATA AAA TTA TTT AAA AGA 
Glu Gin Lys Asp Thr Phe Leu Lys Gly He He Lys Leu Phe Lys Arq 
85 90 95 



32 



!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 

- u <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

<*i> SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GCAAGCTTTT ATTCACTTAC AGGTACAAAA CC 

(2) INFORMATION FOR SEQ ID NO:65- 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 3831 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 

•° (D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

{A) NAME /KEY: CDS 
(B) LOCATION: 1 . . 3828 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

4> ATG ACA TGG CCA GTA AAA GAT TTT AAT TAT AGT GAT CCT GTT AAT GA<" 

Met Tnr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn Asp 
5 10 15 

AAT GAT ATA TTA TAT TTA AGA ATA CCA CAA AAT AAG TTA ATT ACT ACA 96 
Asn Asp lie Leu Tyr Leu Arg He Pro Gin Asn Lys Leu He Thr Thr 
20 25 30 



48 



144 



192 



240 



288 



ATT AAT GAA AGA GAT ATA GGA AAA AAA TTA ATA AAT TAT TTA GTA GTT 116 
/<> lie Asn Glu Arg Asp lie Gly Lys Lvs Leu He Asn Tyr Leu Val Val 
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100 105 



110 



GGT TCA CCT TTT ATG GGA GAT TCA AGT ACG CCT GAA GAT ACA TTT GAT 
5 Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro AsJ Thr HI Asp 

120 125 

TTT ACA CGT CAT ACT ACT AAT ATT GCA GTT GAA AAG TTT GAA AAT GGT 
Phe Thr Arg His Thr Thr Asn lie Ala Val Glu Lys III JJJ £J 
10 135 140 

AGT TGG AAA GTA ACA AAT ATT ATA ACA CCA AGT GTA TTG ATA TTT CPA 
Ser Trp Ly S Val Thr Asn lie lie Thr Pro Ser Val ™ S SI Oly 

150 "5 i6 Y 0 



\5 



CCA CTT CCT AAT ATA TTA GAC TAT ACA GCA TCC CTT ACA TTG CAA GGA 
Pro Leu Pro Asn lie Leu Asp Tyr Thr Ala Ser Leu ?£ lIu SJ 

165 170 ' 



175 



230 235 



lit £1 tT A E CA TCT GAT *** AGG ATT CGT CCA CAA GTT AGC GAG GGA 
He Asn lie Pro Ser Asp Lys Arg lie Arg Pro Gin Val Ser Glu Gly 

245 250 255 



384 



43; 



480 



528 



">() JS* TCA MT CCA TCA TTT GAA GG<3 TTT GGA ACA TTA TCT ATA CTA 

-0 Gin Cln ser Asn Pro Ser Phe Glu Gly Phe Gly Thi llu III Ue Lei "« 

AAA GTA GCA CCT GAA TTT TTG TTA ACA TTT AGT GAT GTA ACA TCT AAT 

Lys Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val ?£ HI £1 624 

CAA AGT TCA OCT GTA TTA GGC AAA TCT ATA TTT TGT ATG GAT CCA GTA 
Cln ser Ser Ala Val Leu Gly Lys Ser lie Phe Cys Met Asp Pro vll 
;,0 215 220 

ATA GCT TTA ATG CAT GAG TTA ACA CAT TCT TTG CAT CAA TTA TAT GG* 
Ue Ala Leu Met His Glu Leu Thr His Ser Leu 55 Gin llu ™ «y 

240 



672 



720 



768 



40 III HI L C I GA Z G ? A ^ ™ ?3 C ™ T5 T GAG GAA TTA TAT ACA 



Phe Phe Ser Gin Asp Gly P~ro A^n Va? Gin Phe G?u Glu Leu Tyl Si 
260 265 270 

III n? A TTA GAT GTT GAA ATA ATA CCT CAA ATT GAA AGA TCA CAA 

Phe Cly Gly Leu Asp Val Glu lie He Pro Gin lie Glu Arg Ser Gin 
" 75 280 285 

H A AGA GAA *** GCA TTA GGT CAC TAT AAA GAT ATA GCG AAA AGA CTT 
Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ue Ala Lys Arg ill 
5() ~* U 295 300 

AAT AAT ATT AAT AAA ACT ATT CCT TCT AGT TGG ATT AGT AAT ATA GAT 
Asn Asn lie Asn Lys Thr lie Pro Ser Ser Trp lie Ser Asn lie AsJ 

310 315 3 20 

AAA TAT AAA AAA ATA TTT TCT GAA AAG TAT AAT TTT GAT AAA GAT AAT 
Lys Tyr Lys Lys lie Phe Ser Glu Lys Tyr Asn Phe Asp lys lip tel 
325 330 33 | 

M) A TTT GTT GTA AAT ATT GAT AAA TTC AAT AGC TTA TAT TCA ,„« 

™ Thr Gly Asn Phe Val val Asn He Asp Lys Phe Asn Ser llu Tyr sir 
340 345 350 



864 



912 



960 



1008 
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GAC TTG ACT AAT GTT ATG TCA GAA GTT GTT TAT TCT TCG CAA TAT AAT 
Asp Leu Thr Asn val Met Ser Glu Val Val Tyr Ser Ser Gin Tvr Asn 
355 360 365 

GTT AAA AAC AGG ACT CAT TAT TTT TCA AGG CAT TAT CTA CCT GTA TTT 
val Lys Asn Arg Thr His Tyr Phe Ser Arg Hie Tyr Leu Pro Val Phe 
370 375 38o 

GCA AAT ATA TTA GAT GAT AAT ATT TAT ACT ATA AGA GAT GGT TTT AAT 
l» Hxa Asn lie Leu Asp Asp Asn lie Tyr Thr lie Arg Asp Gly Phe Asn 
385 390 395 400 

TTA ACA AAT AAA GGT TTT AAT ATA GAA AAT TCG GGT CAG AAT ATA GAA 
Leu Thr Asn Lys Gly Phe Asn lie Glu Asn Ser Gly Gin Asn lie Glu* 
405 410 415 

AGG AAT CCT GCA CTA CAA AAG CTT AGT TCA GAA AGT GTA GTA GAT TTA 
Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
2() 420 425 430 

Phi Th? l GT T A AGA TTA ACA ^ T AGT AGA GAT GAT TCA 

Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
435 440 44 | ^ 

2> ACA TGT ATT AAA GTT AAA AAT AAT AGA TTA CCT TAT GTA GCT GAT AAA 

Thr Cys lie Lys Val Lys Asn Asn Arg Leu Pro Tyr Val Ala Asp Lys 
450 455 46O 

V) ^ T A T T TCA CAA GAA ATA TTT GAA AAT AAA ATT ATT ACA GAT GAG 

Asp Ser He Ser Gin Glu lie Phe Glu Asn Lys lie He Thr Asp Glu 
465 470 475 480 

ACT AAT GTA CAA AAT TAT TCA GAT AAT TTT TCA TTA GAT GAA TCT ATT 
Inr Asn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp CUu Ser lie 
485 490 4 95 

TTA GAT GGG CAA GTT CCT ATT AAT CCT GAA ATA GTA GAT CCA CTA TTA 
Leu Asp Gly Gin Val Pro lie Asn Pro Glu lie Val Asp Pro Leu Leu 
4() 500 505 510 

CCC AAT GTT AAT ATG GAA CCT TTA AAT CTT CCA GGT GAA GAA ATA GTA 
Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 
515 520 525 

4^ TTT TAT GAT GAT ^ tat TTA AAT TCT TAT TAT 

Phe Tyr Asp Asp lie Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tyr T Vr 
530 535 540 Y *' 



65 



ACA ACT TCA GTT GAA GAA GCA TTA GGT TAT AGC AAT AAG ATA TAC ACA 
Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys lie Tyr Thr 
565 570 575 

TTT TTA CCT AGC TTA GCT GAA AAA GTG AAT AAA GGT GTT CAA GCA GGT 
Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Gly 
60 80 585 590 

TTA TTC TTA AAT TGG GCG AAT GAA GTA GTT GAG GAT TTT ACT ACA AAT 
Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 
595 600 605 



ATT ATG AAG AAA GAT ACA TTG GAT AAA ATA TCA GAT GTA TCA GTA ATA 
He Met Lys Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Val He 
610 615 620 

ATT CCA TAT ATA GGA CCT GCC TTA AAT ATA GGA AAT TCA GCA TTA AGG 
He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arg 

- 345 - 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



T„T T TG J** CAA ^ TTA AGT AAT AAT GTT GAA AAT ATT ACT CTT 1680 

Hi Leu Glu Se r Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 

550 555 560 



1728 



1776 



1824 



1872 
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625 

640 



630 635 



GGA AAT TTT AAG CAA GCA TTT GCA ACA GCT GGT GTA GCT TTT TTA ttu 
Gly Asn Phe Lys Gin Ala Phe Ala Thr Ala Gly vll a" Phe Leu Uu 
645 650 655 



IT CCA GAG ^ ACT ATA CCT GCA CTC GGT GTA TTT ACC TTT 
Glu Gly Phe Pro Glu Phe Thr lie Pro Ala Leu Gly vll IFe Thr Phe 2016 
660 665 670 



AAT GAG AGT TTT GAA AAT ACA ATG CCT TTT AAT ATT TTT TCA TAT ACT 
Asn Glu Ser Phe Glu Asn Thr Met Pro Phe Asn III III Ser Tyr ?£ 
835 840 84 5 

AAT AAT TCT TTA TTA AAA GAT ATA ATT AAT GAA TAT TTC AAT AGT ATT 
Asn A=n ser Leu Leu Lys Asp lie n e Asn Glu Ty ? ™ A^n lie 

855 Q60 



2064 



# s sj s si si an s: ss;^ s s sk si 

SI E SI Si K KT ^ £ S £ 21 S 5 Si S »» 



700 

">0 Sit W ^ I GG TTG TCA AGA ATT ACT ACT *-AA TTT AAT CAT ATA AAT 

-<> *al Se> Asn Trp Leu Ser Arg lie Thr Thr Gin Phe Asn Hil lie Jin 216 ° 

710 7 15 720 

Itl ^ m TG TAT GAT TCT TTA AGT TAT CAG GCA GAT GCA ATC AAA GCT 
2 . Tyr Gin Met Tyr Asp Ser Lou Ser Tyr Gin Ala Asp Ala lie ^ mI 

25 730 735 

AAA ATA GAT TTA GAA TAT AAA AAA TAC TCA GGA AGT GAT AAA GAA AAT 
U<s He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys G^ £J 
30 745 750 

lie ^ ser Val ^ 7™ *** AGT TTA GAT GTA ^ ATT 

ne l>_ sex Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys U e 

755 7 60 7 6 5 y 

7 ° l CG GAA GCA ATG MT AAT ATA AAT AAA TTT ATA CGA GAA TGT TCT era 

Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg G^ Cyl lor vll 

77 5 780 

« S S S SI £ S E? S S £ S JH 2S ™ 

790 800 

AAG TTT GAT TTA AGA ACT AAA ACA GAA TTA ATT AAT CTT ATA GAT AGT 
4S Lys Phe Asp Leu Arg Thr Lys Thr Glu Leu He Asn l" lie ts P Ser 

805 810 815 

CAT AAT ATT ATT CTA GTT GGT GAA GTA GAT AGA TTA AAA GCA AAA GTA 
His Asn lie lie Leu Val Gly Glu Val Asp Arg Leu Y^s Ala vll 
50 820 825 830 



2208 



2256 



2304 



2352 



2448 



>496 



2544 



2592 



« iS 2? S £ S IIS £ 21 Si E Si £ E SI S ss »« 

870 875 880 

» AT iu A I CA GGA TAT AAT GCA GAA GTG AGG GTA GGA GAT AAT GTT CAA 
fi . Thr Ser G ^ Tyr Asn Ala Glu Val Arg Val Gly AsJ vll 

885 890 895 

CTT AAT ACG ATA TAT ACA AAT GAC TTT AAA TTA AGT AGT TCA GGA rax 
Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Leu Sel Sel Ter Gly Asp 
70 900 9 °5 910 * 



268B 



2736 
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15 



40 



45 



65 



AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT AGC GCT ATT TAT 
Lys lie lie Val Asn Leu Asn Asn Asn lie Leu Tyr Ser Ala He Tyr 
915 920 925 

GAG AAC TCT AGT GTT AGT TTT TGG ATT AAG ATA TCT AAA GAT TTA ACT 
Glu Asn Ser Ser Val Ser Phe Trp lie Lys lie Ser Lys Asp Leu i£ 
930 935 9 4 o 

AAT TCT CAT AAT GAA TAT ACA ATA ATT AAC AGT ATA GAA CAA AAT TCT 
Asn Ser His Asn Glu Tyr Thr He lie Asn Ser He Glu Gin Asn Ser 
945 950 955 960 

GGG TGG AAA TTA TGT ATT AGG AAT GGC AAT ATA GAA TGG ATT TTA CAA 
Gly Trp Lys Leu Cys He Arg Asn Gly Asn He Glu Trp ill Lei Gin 
965 970 975 

A« SIT ^ T ^ A *** AGT TTA ATT TTT GAT TAT AGT GAA TCA 

Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 



AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT ATT AAA GAT TAT 

He I 
1085 



A-n T o Zl\ i X? X, , 1 iM AAT GTT ATT AAA GAT TAT 
Asn He Val Tyr Glu Gly Gin lie Leu Arg Asn Val He Lys Asp Tyr 
1075 1080 



GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA AAT CCT ATT ACT 
Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
1125 H30 

ill I« A I CT GAT ^ AAT CCT TAT AGT AGA ATT ™ AAT GGA 

He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 

1140 1145 U50 

GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG AAA TAT ATG ATA 
Asp Asn lie lie Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met lie 
1155 1160 H65 

tT^ ^ * M JP GAT ACA ATA TAT GCA ACA CAA GGA GGA GAG TGT TCA 

?f?n P ASP Thr Ue Tyr Ala Thr Gln Gi y Gl 7 G ^ Cys Ser 

1170 1175 1180 



2784 



2832 



2880 



2928 



>976 



3024 



3072 



2() 980 985 

TTA AGT CAT ACA GGA TAT ACA AAT AAA TGG TTT TTT GTT ACT ATA ACT 
Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr lie Thr 
995 iooo 1005 

25 AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT GGA GAA TTA AAG 

Asn Asn lie Met Gly Tyr Met Lys Leu Tyr He Asn Gly Glu Leu JJS 
±01 ° 1015 1020 

30 Gin ^1 ^ tT T 5t* GAT TTA GAT GAG GTT AAG TTA GAT AAA ACC 3120 

Mi Gin Sei Gin Lys He Glu Asp Leu Asp Glu Val Lys Leu Asp Lys Thr 

10 " 5 1030 1035 10 40 

lit SI? H T tT A GAT GAG AAT ATA GAT GAG AAT CAG ATG CTT TGG 

He Val Phe Gly He Asp Glu Asn He Asp Glu Asn Gin Met Leu Trp 

10/15 1050 1055 

tTI £ GA ^ AT ll r ATT TTT TCT ^ GAA TTA AGT AAT GAA GAT ATT 

He Arg Asp Phe Asn He Phe Ser Lys Glu Leu Ser Asn Glu Asp lie 
1060 1065 1070 



3168 



3216 



3264 



331: 



TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT ATT ATT AAT GAT 

P ?nL L6U LyS Phe Asp Thr Glu T V r Tyr He He Asn Asp 

1090 1095 iioo 

AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT GTA CTT GTA CTT 3360 

t?n, Tyi " 116 ASP Arg Tyr Ue Ala Pro Glu Val Leu Val Lei 

1105 1H5 1120 



3408 



3456 



3504 



355: 



70 ^ I GT GTA TAT GCA TTA ^ TTA CAG AGT AAT TTA GGT AAT TAT 3600 

/0 Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tyr 
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3792 



3831 



1185 1190 1195 1200 

GGT ATA GGT ATA TTT ACT ATA AAA AAT ATT GTA TCT AAA AAT AAA TAT 3 64 8 

Gly lie Gly He Phe Ser He Lys Asn He Val Ser Lys Asn Lys Tvr 
1205 1210 1215 

TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA ATG CTT CTA GCA 3 6 96 

Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr Met Leu L«u Ala 
!220 1225 1230 

GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT GCA TAC ACG CCA 3744 
Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Aon Ala Tyr Thr Pro 
1235 1240 1245 

GTT GCA GTA ACT AAT TAT GAA ACA AAA CTA TTA TCA ACT TCA TCT TTT 
Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
12 50 1255 126O 

TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG TAA 
Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 
1265 1270 1275 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1276 amino acids 

(B) TYPE: amino acid 
(Di TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn Asp 
1 5 10 15 

Asn Asp He Leu Tyr Leu Arg lie Pro Gin Asn Lys Leu He Thr Thr 
20 25 30 

Pro Val Lys Ala Phe Met lie Thr Gin Asn He Trp Val He Pro Glu 
35 40 45 

Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arq Pro 
3 ° 55 60 

Thr ser Lys Tyr Gin Ser Tyr Tyr Asp Pro Ser Tvr Leu Ser Thr Asp 
65 7 ° 75 ' 80 

Glu Gin Lys Asp Thr Phe Leu Lys Gly He He Lys Leu Phe Lys Arq 
85 90 95 

He Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Val Val 
100 105 no 

Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp 
115 120 125 

Phe Thr Arg His Thr Thr Asn lie Ala Val Glu Lys Phe Glu Asn Glv 
I 30 135 140 

Ser Trp Lys Val Thr Asn He He Thr Pro Ser Val Leu lie Phe Glv 
145 i5 ° 155 160 

Pro Leu Pro Asn He Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gin Gly 
165 170 175 

Gin Gin Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser He Leu 
180 185 190 

Lys Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn 
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195 200 



205 



10 



15 



20 



Gin Ser Ser Ala Val Leu Gly Lys Ser lie Phe Cys Met Asp Pro Val 
10 215 220 

lie Ala Leu Met His Glu Leu Thr His Ser Leu His Gin Leu Tvr Glv 
~ S 230 235 ' 240 

He Asn lie Pro Ser Asp Lys Arg lie Arg Pro Gin Val Ser Glu Gly 
245 250 255 

Phe Phe Ser Gin Asp Gly Pro Asn Val Gin Phe Glu Glu Leu Tyr Thr 
260 265 270 

Phe Gly Gly Leu Asp Val Glu lie lie Pro Gin lie Glu Arg Ser Gin 
* 75 2B0 285 

Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp lie Ala Lys Arg Leu 
- au 295 300 

Asn Asn lie Asn Lys Thr lie Pro Ser Ser Trp He Ser Asn lie Asp 
J05 310 315 320 

Lys Tyr Lys Lys lie Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn 
325 330 335 

Thr Gly Asn Phe Val Val Asn lie Asp Lys Phe Asn Ser Leu Tvr ser 
340 345 350 ' 

Asp Lou Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tvr Asn 
/ 355 360 3 6 5 

Val Lys Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
55 J 0 375 3 8 o 

Ala Asn lie Leu Asp Asp Asn lie Tyr Thr lie Arg Asp Gly Phe Asn 
385 390 395 400 

Leu Thr Asn Lys Gly Phe Asn He Glu Asn Ser Gly Gin Asn He Glu 

1105 410 415 

Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
120 425 430 

Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
4 35 440 445 

Thr Cys He Lys Val Lys Asn Asn Arg Leu Pro Tyr Val Ala Asp Lys 
5() 450 455 460 X 

Asp Ser He Ser Gin Glu He Phe Glu Asn Lys lie He Thr Asp Glu 
465 470 47S * 480 

Thr Asn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp Glu Ser He 
4 85 4 9 o 495 

Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 
500 505 510 

Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 
515 520 525 

Phe Tyr Asp Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tvr Tyr 
530 535 54Q - 

Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 

550 5 5 5 56Q 

Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lvs He Tyr Thr 
56S 5 7o ' 57s 



40 



45 



60 



65 



70 
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Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Gly 



Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 

600 g 05 

He Met Lys Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Val He 

615 620 
lie Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arg 

635 640 
Gly Asn Phe Lys Gin Ala Phe Ala Thr Ala Gly Val Ala Phe Leu Leu 

645 650 6S5 

Glu Gly Phe Pro Glu Phe Thr He Pro Ala Leu Gly Val Phe Thr Phe 



665 



670 



Tyr Ser Ser He Gin Glu Arg Glu Lys lie Ue Lys Thr He Glu Asn 

680 6 g 5 

Cys Leu Glu Gin Arg Val Lys Arg Trp Lys Asp Ser Tyr Gin Trp Met 

695 700 

Val ser Asn Trp Leu Ser Arg He Thr Thr Gin Phe Asn His He Asn 

710 7 15 720 

Tyr Gin Met Tyr Asp Ser Leu Ser Tyr Gin Ala Asp Ala lie Lys Ala 

730 



735 



Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu Asn 

He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val L ys He 

760 

Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser Val 



780 



Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu Asn 

790 7 95 800 

Lys Phe Asp Leu Arg Thr Lys Thr Glu Leu He Asn Leu lie Asp Ser 

805 810 



815 



His Asn lie lie Leu Val Gly Glu Val Asp Arg Leu Lys Ala Lys Val 

825 830 

Asn Glu Ser Phe Glu Asn Thr Met Pro Phe Asn He Phe Ser Tyr Thr 
835 840 845 > 

Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Ser He 

855 B60 

Asn Asp ser Lys lie Leu Ser Leu Gin Asn Lys Lys Asn Ala Leu Val 

870 875 880 

Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg Val Gly Asp Asn Val Gin 
885 890 B95 

Leu Asn Thr lie Tyr Thr Asn Asp Phe Lys Leu Ser Ser Ser Gly Asp 
900 905 910 P 

Lys lie lie Val Asn Leu Asn Asn Asn He Leu Tyr Ser Ala He Tyr 

920 925 

Glu Asn ser Ser Val Ser Phe Trp Ue Lys He Ser Lys Asp Leu Thr 

935 

Asn Ser His Asn Glu Tyr Thr He lie Asn Ser He Glu Gin Asn Ser 
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945 950 955 960 

Gly Trp Lys Leu Cys lie Arg Asn Gly Asn He Glu Trp lie Leu Gin 
5 9 *5 970 975 

Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 
980 985 990 

Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr He Thr 
1U 995 1000 10 05 

Asn Asn lie Met Gly Tyr Met Lys Leu Tyr lie Asn Gly Glu Leu Lys 
1010 1015 102 0 7 

?i?c; Ser Gln LyS Ile Glu Asp Leu As P Glu Val L Y* Leu Asp Lys Thr 
102b 1030 1035 1040 

He Val Phe Gly lie Asp Glu Asn Ile Asp Glu Asn Gin Met Leu Trp 
20 1045 1050 1055 

Ile Arg Asp Phe Asn Ile Phe Ser Lys Glu Leu Ser Asn Glu Asp Ile 
1060 1065 107 0 

Asn Ile Val Tyr Glu Gly Gin Ile Leu Arg Asn Val Ile Lys Asp Tyr 
1075 1080 108 5 

TrP ?nL ASn Pr ° LSU LyS Phe Asp Thr Glu T ^ r T Y r n e He Asn Asp 
1090 1095 iioo 

M) tfn J yr Ue ASP Al " g Tyr Ile Ala Pro Glu Ser Asn Val Leu val Leu 
1105 1110 H15 U20 

Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro Ile Thr 
35 H25 1130 1135 

Ile Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg lie Leu Asn Glv 
H40 H45 use 

Asp Asn lie Ile Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met Ile 
1155 H60 ii 65 

Ile Arg Asp Thr Asp Thr Ile Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
1170 1175 1180 

4> Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tvr 

85 1190 H95 1200 

Gly lie Gly He Phe Ser Ile Lys Asn Ile Val Ser Lys Asn Lys Tyr 
5() 1205 12 1° 1215 

Cys Ser Gin Ile Phe Ser Ser Phe Arg Glu Asn Thr Met Leu Leu Ala 
1220 1225 1230 

Asp Ile Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn Ala Tyr Thr Pro 
1235 1240 1245 

Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
1250 1255 126O 

60 Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 

1265 1270 1275 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 108.. 1460 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 



Met Gly His 
1 



CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 
His His H iS His His His His His His Ser Ser Gly Hil lie 35 oTy 



IS 



CGT CAT ATG GCT AGC ATG GCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 
Arg Hxs Met Ala Ser Met Ala Leu Leu Lys Asp lie Ue HI ™ 
30 *" 25 30 3 5 



TTC AAT AGT ATT AAT GAT TCA AAA ATT TTG AGC TTA CAA AAC AAA AAA 
Phe Asn ser lie Asn Asp Ser Lys lie Leu Ser LeS G^n Aon Lys Lys 
40 IS 50 

AAT GCT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGG GTA GGA 
Asn Ala Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg v™ Gi? 

55 cn „ Z 1 



Tvl of I CA T A AGT CAT ACA GGA TAT ACA ™ T AAA TGG TTT TTT 

65 V i« LeU SSr HlS Thr G1 V T V r Th r Asn Lys Tip Phe Phe 

170 175 

GTT ACT ATA ACT AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT 
val Thr lie Thr Asn Asn lie Met Gly Tyr Met Lys Leu IvI lie tl 
70 185 190 ' 195 



60 
116 

164 



260 



308 



55 60 

40 GAT f* 1 " GT T CAA CTT ACG ATA TAT ACA AAT GAC TTT AAA TTA AGT 

40 Asp Asn Val Gin Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Uu sir 

75 80 

AGT TCA GGA GAT AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT 
Ser Ser Gly Asp Lys lie He Val Asn Leu Asn Asn AsJ lie Leu lyl 
B5 90 95 ' 

AGC GCT ATT TAT GAG AAC TCT AGT GTT AGT TTT TGG ATT AAG ATA TCT 
Ser Ala He Tyr Glu Asn Ser Ser Val Ser Phe Tr£ 111 ^s lie III 
50 105 110 us 

AAA GAT TTA ACT AAT TCT CAT AAT GAA TAT ACA ATA ATT AAC AGT ATA 
Lys Asp Leu Thr Asn Ser His Asn Glu Tyr Thr lie i" s er lie 

120 125 l3() 

55 GAA CAA ^ I CT G f I GG *** TTA TGT ATT AGG AAT GGC AAT ATA GAA 

Glu Gin Asn Ser Gly Trp Lys Leu Cys lie Arg Asn Gly Asn lie Glu 
135 140 145 

60 l GG tT T TTA CAA GAT GTT AGA AAG TAT AAA AGT TTA ATT TTT GAT ,o 6 

60 Trp lie Leu Gin Asp Val Asn Arg Lys Tyr Lys Ser Leu lie III 2J 

U 155 160 



404 



452 



500 



548 



644 



692 
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GGA GAA TTA AAG CAG ACT CAA AAA ATT GAA GAT TTA GAT GAG GTT AAG 
»ly Glu Leu Lys Gin Ser Gin Lys lie Glu Asp Leu Asp Glu val Lys 
200 205 210 

TTA GAT AAA ACC ATA GTA TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT 
LeU ASP L * s ™r lie Val Phe Gly He Asp Glu Asn lie tsp Su Itn 
* 15 220 225 

CAG ATG CTT TGG ATT AGA GAT TTT AAT ATT TTT TCT AAA GAA TTA ACT 
Cln Met Leu Trp lie Arg Asp Phe Asn He Phe Ser Lys G^u Leu £1 
230 2 35 240 

AAT GAA GAT ATT AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT CTT 
Asn Glu Asp He Asn lie Val Tyr Glu Gly Gin lie LTu £g £1 Val 

250 255 

ATT AAA GAT TAT TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT 
lie uys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp t2 SI Tyr ™ 



270 



275 



111 n ^ n AT AAT TAT ATA GAT AGG TAT AT T GCA CCT GAA AGT AAT 
He lie Asn Asp Asn Tyr lie Asp Arg Tyr He Ala Pro Glu Ser A^n 
280 28S 290 

GTA CTT GTA CTT GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA 
■al Leu Val Leu Val Arg Tyr Pro Asp Arg Ser L^s llu Tyr TnT Gly 
295 300 305 



A^n £ro ill Th T tT T ?** T A GTA TCT GAT ^ *** CCT TAT A GT AGA 
Aon Ho lie Thx lie Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg 

315 320 



He" Leu A^n r?v tT A A T T CTT CAT ATG TTA TAT ™T AGT A GG 

lie Leu Asn Gly Asp Asn lie lie Leu His Met Leu Tyr Asn Ser Arc 

330 335 

iJl Itl Mir tT A rT A AGA GAT ACT GAT ACA ATA TAT GCA ACA CAA GGA 
L/s T yi Met He He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly 

345 350 355 

Gly eta rv? If A CAA AAT J" GTA TAT GCA TTA TTA CA ° ACT AAT 

-ly Glu cys Ser Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn 

360 365 370 

TTA GGT AAT TAT GGT ATA GGT ATA TTT AGT ATA AAA AAT ATT fiTA Trr 
Leu Gly A,n Tyr Gly He Gly H e Phe Ser He L^ ^n ill SJ Ser 
375 380 3 85 

^ ™ T TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA 

.ys Asn Lys Tyr Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr 

395 400 

ATG CTT CTA GCA GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT 
Mec Leu Leu Ala Asp He Tyr Lys Pro Trp Arg Phe Ser Phe 

£ AC £S G CCA GTT GCA GTA ACT ^ TAT GAA ACA AAA CTA TTA TCA 
Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser 
425 430 435 

ACT TCA TCT TTT TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG 
4hr Ser S " Phe Tr P ^ Phe lie Ser Arg Asp Pro Gly Trp vll Glu 
440 445 450 

TAAAAGCTT 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 amino acids 



740 



788 



836 



884 



932 



980 



1028 



1076 



1124 



1172 



1220 



1268 



1316 



1364 



1412 



1460 



1469 
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(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
Met Gly His His His His His His His His Hi S His Ser Ser Gly His 
10 5 10 15 

lie Glu Gly Arg His Met Ala Ser Met Ala Leu Leu Lys Asp lie lie 

^ 2S 30 

Asn Clu Tyr Phe Asn Ser lie Asn Asp Ser Lys lie Leu Ser Leu Gin 



15 



20 



50 



60 



65 



45 

Asn Lys Lys Asn Ala Leu Val A sp Thr Ser Gly Tyr Asn Ala Glu val 



60 

Arg Val Gly Asp Asn Val Gin Leu Asn Thr lie Tyr Thr Asn Asp Phe 



80 



Lys Leu ser Ser Ser Gly Asp Lys lie He Val Asn Leu Asn Asn Asn 

25 85 90 95 

lie Leu Tyr Ser Ala lie Tyr Glu Asn Ser Ser Val Ser Phe Trp lie 
iuu 105 



110 



10 ^ i?c LyS ASP LeU Thr Asn Ser Hi * Asn Glu Tyr Thr lie He 

120 12 5 



Asn Ser lie Glu Gin Asn Ser Gly Trp Lys Leu cys lie Arg Asn Gly 

135 140 
Asn He Glu Trp He Leu Gin Asp Val Asn Arg Lys Tyr Lys Ser Leu 



150 155 



160 



He Phe Asp Tyr Ser Glu Ser Leu Ser His Thr Gly Tyr Thr Asn Lys 
40 170 175 

Trp Phe Phe yal Thr He Thr Asn Asn He Met Gly Tyr Met Lys Leu 
180 185 190 

45 ^ J« Gly G1U LeU LyS Ser Gln 1^ Glu Asp Leu Asp 

Glu Val Lys Leu Asp Lys Thr He Val Phe Gly He Asp Glu Asn Ilo 



220 



Asp Glu Asn Gin Met Leu Trp lie Arg Asp Phe Asn He Phe s-r Lys 

230 235 2 4 0 

Glu Leu Ser Asn Glu Asp He Asn He Val Tyr Glu Gly Gin He Leu 

55 245 250 255 

Arg Asn Val lie Lys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp Thr 

260 265 270 

Glu Tyr Tyr He He Asn Asp Asn Tyr He Asp Arg Tyr He Ala Pro 

275 280 285 

Glu Ser Asn Val Leu Val Leu Val Arg Tyr Pro Asp Arg Ser Lys Leu 

Tyr Thr Gly Asn Pro lie Thr He Lys Ser Val Ser Asp Lys Asn Pro 

310 315 3 20 

Tyr Ser Arg He Leu Asn Gly Asp Asn He He Leu His Met Leu Tyr 

70 35 330 335 
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10 



40 



60 



Asn Ser Arg Lys Tyr Met lie lie Arg Asp Thr Asp Thr lie Tyr Ala 
340 345 35o 

Thr Gin Gly Gly Glu Cys Ser Gin Asn Cys Val Tyr Ala Leu Lys Leu 
355 360 365 

Gin Ser Asn Leu Gly Asn Tyr Gly He Gly He Phe Ser He Lys Asn 
370 375 380 

lie Val ser Lys Asn Lys Tyr Cys Ser Gin lie Phe Ser Ser Phe Arq 
385 390 395 4 oo 

Glu Asn Thr Met Leu Leu Ala Asp lie Tyr Lys Pro Trp Arg Phe Ser 
405 410 415 

Phe Lys Asn Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lvs 
420 425 430 

Leu Leu Ser Thr Ser Ser Phe Trp Lys Phe lie Ser Arg Asp Pro Gly 
435 440 445 r r 

Trp Val Glu 
450 

(2) INFORMATION FOR SEQ ID NO : 6 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: other nucleic acid 
CA) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

GCAAGCTTTT ACTCTACCCA TCCTGGATCC CT 

(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3825 base pairs 
<B) TYPE: nucleic acid 
4 - (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 3822 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 70: 

ATG CCA GTT GCA ATA AAT AGT TTT AAT TAT AAT GAC CCT GTT AAT GAT 

Met Pro Val Ala He Asn Ser Phe Asn Tyr Asn Asp Pro Vai Asp 

5 10 15 

A^n Th* tT T 7™ l AC ATG CAG " A CCA ™ T GAA GAA AAA AGT AAA AAA 

Asp Thr lie Leu Tyr Met Gin lie Pro Tyr Glu Glu Lvs Ser Lys Lys 

20 25 30 



70 



AGA AAT ACA ATA GGA ACG AAT CCT AGT GAT TTT GAT CCA CCG GCT TCA 
Arg Asn Thr He Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
50 5 5 60 



32 



48 



ye 



ftS l AT l hT GCT TTT GAG ATT ATG CGT GTT TGG ATA ATT CCT GAG 144 

63 Tyr Tyr Lys Ala Phe Glu lie Met Arg Asn Val Trp He lie Pro Glu 

35 40 45 



192 
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10 



20 



30 



40 



50 



60 



65 



70 



TTA AAG AAC GGA AGC AGT GCT TAT TAT GAT CCT AAT TAT TTA ACC ACT 
Leu Lys Asn Gly Ser Ser Ala Tyr Tyr Asp Pro Asn Tyr Leu Thr Thr 

70 75 6o 

GAT GCT GAA AAA GAT AGA TAT TTA AAA ACA ACG ATA AAA TTA TTT AAG 
Asp Ala Glu Lys Asp Arg Tyr Leu Lys Thr Thr He Lys Leu Phe Lys 
85 90 95 

Aro nl ^ c GT ^ T CCT GCA GGG *** GTT TTG TTA CAA GAA ATA TCA 336 
Arg He Asn Ser Asn Pro Ala Gly Lys Val Leu Leu Gin Glu lie Ser 
100 105 110 



Gly His Asn Ser Ser Thr Glu Ser Phe lie Ala Asp Pro Ala He Ser 
210 215 



220 



240 



288 



384 



432 



TAT GCT AAA CCA TAT TTA GGA AAT GAC CAC ACG CCA ATT GAT GAA TTC 
Tyr Ala Lys Pro Tyr Leu Gly Asn Asp His Thr Pro lie Asp Glu Phe 

120 125 

TCT CCA GTT ACT AGA ACT ACA AGT GTT AAT ATA AAA TTA TCA ACT AAT 
Ser Pro Val Thr Arg Thr Thr Ser Val Asn lie Lys Leu Ser Thr £n 

135 140 

v T T ^ c GT T A ATG TTA TTG *** CTT CTT GTA TTG GGA GCA GGA CCT 
Val Glu ser Ser Met Leu Leu Asn Leu Leu Val Leu Gly Ala Glv Pro 

b 150 155 * 160 

GAT ATA TTT GAA AGT TGT TGT TAC CCC GTT AGA AAA CTA ATA GAT CCA 
Asp lie Phe Glu Ser Cys Cys Tyr Pro Val Arg Lys Leu lie Asp Pro 
i^ 5 170 I? 5 

GAT GTA GTT TAT GAT CCA AGT AAT TAT GGT TTT GGA TCA ATT AAT ATC 576 
Asp val Val Tyr Asp Pro Ser Asn Tyr Gly Phe Gly Ser lie Asn lie 
180 185 190 

25 Thr 17 l CA o CT l AT G ^ TAT ACT TTT *** GAT ATT AGT ^A 624 

Val Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp He Ser Gly 

* ,J 195 200 205 

GGG CAT AAT AGT AGT ACA GAA TCA TTT ATT GCA GAT CCT GCA ATT TCA 6 72 



480 



528 



CTA GCT CAT GAA TTG ATA CAT GCA CTG CAT GGA TTA TAC GGG GCT AGG 72 0 

Leu Ala His Glu Leu lie His Ala Leu His Gly Leu Tyr Gly Ala Arg 
~ 2b 230 235 240 



768 



4-^ GGA GTT ACT TAT GAA GAG ACT ATA GAA GTA AAG CAA GCA CCT CTT ATG 

Gly Val Thr Tyr Glu Glu Thr He Glu Val Lys Gin Ala Pro Leu Met 
245 250 255 

ATA GCC GAA AAA CCC ATA AGG CTA GAA GAA TTT TTA ACC TTT GGA GGT 816 
He Ala Glu Lys Pro He Arg Leu Glu Glu Phe Leu Thr Phe Gly Glv 
260 265 270 

CAG GAT TTA AAT ATT ATT ACT AGT GCT ATG AAG GAA AAA ATA TAT AAC 864 
Gin Asp Leu Asn lie lie Thr Ser Ala Met Lys Glu Lys lie Tyr Asn 
33 275 280 285 

AAT CTT TTA GCT AAC TAT GAA AAA ATA GCT ACT AGA CTT AGT GAA GTT 912 
Asn Leu Leu Ala Asn Tyr Glu Lys He Ala Thr Arg Leu Ser Glu Val 
-90 295 300 



AAT AGT GCT CCT CCT GAA TAT GAT ATT AAT GAA TAT AAA GAT TAT TTT 960 
Asn Ser Ala Pro Pro Glu Tyr Asp He Asn Glu Tyr Lys Asp Tyr Phe 
305 310 315 320 

CAA TGG AAG TAT GGG CTA GAT AAA AAT GCT GAT GGA AGT TAT ACT GTA 1008 
Gin Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Gly Ser Tyr Thr Val 
325 330 335 

AAT GAA AAT AAA TTT AAT GAA ATT TAT AAA AAA TTA TAT AGT TTT ACA 1056 
Asn Glu Asn Lys Phe Asn Glu He Tyr Lys Lys Leu Tyr Ser Phe Thr 
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340 345 350 

GAG AGT GAC TTA GCA AAT AAA TTT AAA GTA AAA TGT AGA AAT ACT TAT 
Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tyr 
^55 360 365 

TTT ATT AAA TAT GAA TTT TTA AAA GTT CCA AAT TTG TTA GAT GAT GAT 
Phe lie Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 380 

ATT TAT ACT GTA TCA GAG GGG TTT AAT ATA GGT AAT TTA GCA GTA AAC 
He Tyr Thr Val Ser Glu Gly Phe Asn lie Gly Asn Leu Ala Val Asn 
385 390 395 4 00 

1^ AAT CGC GGA CAA AGT ATA AAG TTA AAT CCT AAA ATT ATT GAT TCC ATT 

Asn Arg Gly Gin Ser He Lys Leu Asn Pro Lys lie He Asp Ser He 
405 410 415 

CCA GAT AAA GGT CTA GTA GAA AAG ATC GTT AAA TTT TGT AAG AGC GTT 
-U Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 

420 425 430 

ATT CCT AGA AAA GGT ACA AAG GCG CCA CCG CGA CTA TGC ATT AGA GTA 
He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arq Val 
435 440 445 

AAT AAT AGT GAG TTA TTT TTT GTA GCT TCA GAA AGT AGC TAT AAT GAA 
Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
M} 450 455 4 60 

AAT GAT ATT AAT ACA CCT AAA GAA ATT GAC GAT ACA ACA AAT CTA AAT 
Asn Asp He Asn Thr Pro Lys Glu lie Asp Asp Thr Thr Asn Leu Asn 
465 470 475 480 

TO AAT AAT TAT AGA AAT AAT TTA GAT GAA GTT ATT TTA GAT TAT AAT AGT 

Asn Asn Tyr Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 

d(\ A ^ A ATA ° CT CAA ATA TCA WT CGA ACA TTA AAT ACA CTT GTA CAA 1536 

Gin Thr He Pro Gin He Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
5 °° 505 510 



50 



GAC AAT AGT TAT GTG CCA AGA TAT GAT TCT AAT GGA ACA AGT GAA ATA 
Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
*° 515 520 525 



TGG ATA AGC AAA GTA ATA AGA GAT TTT ACC ACT GAA GCT ACA CAA AAA 
Trp He Ser Lys Val He Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
°- 595 600 605 



70 



AGT ACT GTT GAT AAG ATT GCA GAC ATA TCT TTA ATT GTA CCC TAT GTA 
Ser Thr Val Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr Val 
610 615 620 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1584 



GAG GAA TAT GAT GTT GTT GAC TTT AAT GTA TTT TTC TAT TTA CAT GCA 
Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
530 535 540 

CAA AAA GTG CCA GAA GGT GAA ACC AAT ATA AGT TTA ACT TCT TCA ATT 
Gin Lys Val Pro Glu Gly Glu Thr Asn He Ser Leu Thr Ser Ser He 
545 550 555 560 

GAT ACA GCA TTA TTA GAA GAA TCC AAA GAT ATA TTT TTT TCT T<"A GAG 
Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp He Phe Phe Ser Ser Glu 
565 570 575 

AM ll T A T° GAT ACT ATC CCT GTA GCA GCA CTA TTT ATA GAT 17 76 

OU Phe lie Asp Thr He Asn Lys Pro Val Asn Ala Ala Leu Phe lie Asp 

5fi 0 585 590 



1632 



1680 



1728 



1824 



1872 
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GGT CTT GCT TTG AAT ATA ATT ATT GAG GCA GAA AAA GGA AAT TTT GAG 
Gly Leu Ala Leu Asn lie lie lie Glu Ala Glu Lys Gly Aan Pne GlS 
5 630 63S 640 

GAG GCA TTT GAA TTA TTA GGA GTG GGT ATT TTA TTA GAA TTT CTC rra 
Glu Ala Phe Glu Leu Leu Gly Val Gly He Leu leu III S3 Pro 

645 650 655 

10 ^ 7" ^ A rT T S CT GTA ATT TTA GTG TTT ACG ATA AAA TCC TAT ATA 

10 Glu Leu Thr lie Pro Val lie Leu Val Phe Thr lie Lys Ser Tyr II 
660 ggs 670 y 

GAT TCA TAT GAG AAT AAA AAT AAA GCA ATT AAA GCA ATA AAT AAT TCA 
]5 Asp Ser Tyr Glu Asn Lys Asn Lys Ala lie Lys Ala lie i£ J£ £J 

680 695 

TTA ATC GAA AGA GAA GCA AAG TGG AAA GAA ATA TAT ACT TGG ATA GTA 
Leu lie Glu Arg Glu Ala Lys Trp Lys Glu He Tyr Ser Trp lie 52 
o'" 695 700 

TCA AAT TGG CTT ACT AGA ATT AAT ACT CAA TTT AAT AAA AGA AAA GAG 
Ser Asn Trp Leu Thr Arg lie Asn Thr Gin Phe Asn L^s A^g Glu 

715 720 

25 rf* l kT £ AG °? T TTA CAA CAA GTA GAT GCA A ™ AAA ACA GCA 

Gin Met Tyr Gin Ala Leu Gin Asn Gin Val Asp Ala He Lys Thr Ala 

725 730 735 

10 n A £ AT l AT TAT ACT TCA GAT GAG AAA AAT AGA CTT ->256 

•><> lie Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg Leu 

/4U 7 45 750 

III GAA T T AAT A T° AAT AAT ATA GAA GAA G AA TTG AAT AAA AAA 
,. Glu sei Glu Tyr Asn He Asn Asn He Glu Glu Glu Leu Asn Lys Lys 

GTT TCT TTA GCA ATG AAA AAT ATA GAA AGA TTT ATG ACA GAA AGT TCT 
Val ser Leu Ala Met Lys Asn lie Glu Arg Phe Met Thr Glu Ser Ser 



20 



40 



45 



60 



780 

ATA TCT TAT TTA ATG AAA TTA ATA AAT GAA GCC AAA GTT GGT AAA TTA 

lie Ser Tyr Leu Met Lys Leu lie Asn Glu Ala Lys Val Gly Lys Leu 

785 790 795 soo 

AAA AAA TAT GAT AAC CAT GTT AAG AGC GAT TTA TTA AAC TAT ATT CTC 

Lys Lys Tyr Asp Asn His Val Lys Ser Asp Leu Leu Asn Tyr lie Leu 

805 810 815 



GTG ACT AGT ACT TTG AAT AGT AGT ATT CCA TTT GAA CTT TCT TCA TAT 

Val Thr Ser Thr Leu Asn Ser Ser lie Pro Phe Glu Leu Ser Ser Tyr 

835 840 845 

ACT AAT GAT AAA ATT CTA ATT ATA TAT TTT AAT AGA TTA TAT AAA AAA 

Thr Asn Asp Lys lie Leu He He Tyr Phe Asn Arg Leu Tyr Lys" Lys 

850 85S 860 

ATT AAA GAT AGT TCT ATT TTA GAT ATG CGA TAT GAA AAT AAT AAA TTT 

Ue Lys Asp ser Ser He Leu Asp Met Arg Tyr Glu Asn Asn Lvs Phe 

Bb:> 870 875 ' 880 



1920 



196 8 



2016 



2064 



2112 



2160 



2208 



2304 



23S2 



2400 



2448 



50 GAC . u A I CA A T C TTA GGA GAG CAG ACA AAT GAA TTA AGT GAT TTG 24 96 

3U Asp H 1S Arg Ser He Leu Gly Glu Gin Thr Asn Glu Leu Ser Asp Leu 
820 825 830 



2544 



2592 



2640 
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4S 



ATA GAT ATC TCT GGA TAT GGT TCA AAT ATA AGC ATT AAT GGA AAC GTA 
lie Asp He ser Gly Tyr Gly Ser Asn He Ser lie Asn Gly Asn Val 
8Q 5 890 895 

TAT ATT TAT TCA ACA AAT AGA AAT CAA TTT GGA ATA TAT AAT AGT AGG 
Tyr lie Tyr Ser Thr Asn Arg Asn Gin Phe Gly lie Tyr Asn Ser Arg 
900 905 gio 



AGA TAT CAA AAT TTT AGT ATT AGT TTC TGG GTA AGG ATT CCT AAA CAC 
*rg Tyr Gin Asn Phe Ser lie Ser Phe Trp Val Arg lie Pro Lys His 

930 935 



940 



TAC AAA CCT ATG AAT CAT AAT CGG GAA TAC ACT ATA ATA AAT TGT ATG 

Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He Asn Cys Met 

945 950 955 gso 

GGG AAT AAT AAT TCG GGA TGG AAA ATA TCA CTT AGA ACT GTT AGA GAT 

Gly Asn Asn Asn Ser Gly Trp Lys lie Ser Leu Arg Thr Val Arg Asp 

965 970 975 

TGT GAA ATA ATT TGG ACT TTA CAA GAT ACT TCT GGA AAT AAG GAA AAT 

Cys Glu lie lie Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 

980 9B5 990 



^ GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT TTT AAT ACG 
Asp Asp Glu Thr Tyr Val Gly lie Arg Tyr Phe Lvs Val Phe Asn Thv 
1060 1065 ' 1070 



1100 



2668 



2736 



CTT AGT GAA GTT AAT ATA GCT CAA AAT AAT GAT ATT ATA TAC AAT AGT 2784 
10 Leu Ser Glu Val Asn lie Ala Gin Asn Asn Asp lie lie Tvr Asn Ser 
915 920 925 * 



2832 



2880 



292B 



2976 



>() i'I* rV H iAT GAA GAA CTT AGG ATA TCT TAT ATA AAT 3024 

Leu Iie phe A *g Tyr Glu Glu Leu Asn Arg lie Ser Asn Tyr lie Asn 
"5 1000 1005 

AAA TGG ATT TTT GTA ACT ATT ACT AAT AAT AGA TTA GGC AAT TCT AGA 

n o 75 ?n?n Ile ^ Val Thr Ile Thr Asn Asn Al "9 Leu Gly Asn Ser Arq 

1010 1015 1020 

ATT TAC ATC AAT GGA AAT TTA ATA GTT GAA AAA TCA ATT ^CG AAT TTA 
lie Tyr lie Asn Gly Asn Leu He Val Glu Lys Ser Ile Ser Asn Leu 
4() 10 ~ 5 1030 1035 i 0 40 

GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT GTT GGT TGT 
Gly Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Glv Cys 
1Q 45 1050 1055 



3072 



3120 



3168 



3216 



SO ^ 7 GAT ACA GAA ATT GAG ACT TTA TAT AGT AAT GAG CCA GAT 3 26 4 

M «lu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn Glu Pro Asp 

1075 1080 1085 

Pro col tT C 7™ ^ ^ C TAT TGG GGA TAT TTG CTA TAT AAT AAA 

53 ?090 1095^ ^ ^ LS -- LeU Tyr Asn LyS 



3312 



AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT ATT ACT CTG 
Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr He Thr Leu 
60 1110 HIS 1120 

AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT ACT GAA GGC 
Asn Ser Gly He Leu Asn lie Asn Gin Gin Arg Gly Val Thr Glu Glv 
x 125 ii30 

65 TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA GTC ATT ATA 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val He Ile 
H40 1145 use 

70 a GA MT GGT CCT ATA GAT ATA TCT AAT ACA GAT AAT TTT GTT AGA 3 5 04 

70 Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn Phe Val Arg 

- 359 - 



3360 



3408 



3456 
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1155 1160 



1165 



_ „ *" n ™ n AUT AAT GGA TGC TTT Tnn apt 

Tyr As^Asn lie Arg Arg Asn^hr Ser Ser Asn Gly Cys Phe Jrp Ser 



AAA AAC GAT CTA GCA TAC ATT AAT GTA GTA GAT CGT GGT GTA GAA TAT 
Lys Asn Asp Leu Ala Tyr lie Asn Val Val Asp Arg Gly Val Glu lyl 

1175 H80 

CGG TTA TAT GCT GAT ACA AAA TCA GAG AAA GAG AAA ATA ATA AGA ACA 
Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys lie lie Arg Thr 

1190 H95 1200 

TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG GAT TCA ATA 
Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met Asp Ser lie 
I 205 1210 x2is 

G?v ^1 f T ^ ATG ^ TTT CAA ^ AAT AAT GGG AGC AAT ATA 

Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly Ser Asn lie 
1220 1225 1230 

GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT AGT TGC TAT 
Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val All Ser Ser Tr P ™ 
!* J5 1240 12 45 

?vr ^ £1 ? GA ^ GA ^ T ACT AGC AGT AAT GGA TGC TTT TGG AGT 

Gly ( 
1260 

TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGA 
Ser lie Ser Lys Glu Asn Gly Trp Lys Glu 
1265 12 70 

(2) INFORMATION FOR SEQ ID NO: 71: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1274 ammo acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE; protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71: 

Met Pro Val Ala lie Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp 
1 S 10 is 

Asp Thr lie Leu Tyr Met Gin lie Pro Tyr Glu Glu Lvs Ser Lvs Lys 
20 25 ' 3C * 

Tyr Tyr Lys Ala Phe Glu He Met Arg Asn Val Trp lie lie Pro Glu 
35 40 45 

Arg Asn Thr lie Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
bU 55 60 

Leu Lys Asn Gly Ser Ser Ala Tyr Tyr Asp Pro Asn Tyr Leu Thr Thr 

70 75 so 

Asp Ala Glu Lys Asp Arg Tyr Leu Lys Thr Thr lie Lys Leu Phe Lvs 

85 90 95 

Arg He Asn Ser Asn Pro Ala Gly Lys Val Leu Leu Gin Glu lie Ser 
100 105 110 

Tyr Ala Lys Pro Tyr Leu Gly Asn Asp His Thr Pro lie Asp Glu Phe 
lib 120 125 

Ser Pro Val Thr Arg Thr Thr Ser Val Asn lie Lys Leu Ser Thr Asn 
iJU 135 140 

Val Glu Ser Ser Met Leu Leu Asn Leu Leu Val Leu Gly Ala Gly Pro 

150 155 160 



3552 



3600 



3648 



3696 



3744 



3792 



3825 



- 360 - 



Asp lie Phe Glu Ser Cys Cys Tyr Pro Val Arg Lys Leu lie Asp Pro 
165 170 175 

Asp Val Val Tyr Asp Pro Ser Asn Tyr Gly Phe Glv Ser lie Asn He 
180 185 * 190 

Val Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp lie Ser Glv 
195 200 205 

Gly His Asn Ser Ser Thr Glu Ser Phe lie Ala Asp Pro Ala lie Ser 
** 10 215 220 

Leu Ala His Glu Leu He His Ala Leu His Gly Leu Tyr Gly Ala Arg 

230 235 240 

Gly Val Thr Tyr Glu Glu Thr lie Glu Val Lys Gin Ala Pro Leu Met 
245 250 255 

He Ala Glu Lys Pro lie Arg Leu Glu Glu Phe Leu Thr Phe Gly Glv 
260 265 270 

Gin Asp Leu Asn lie lie Thr Ser Ala Met Lys Glu Lys lie Tyr Asn 
-75 280 285 

Asn Leu Leu Ala Asn Tyr Glu Lyc He Ala Thr Arg Leu Ser Glu Val 
-90 2 95 



300 



Asn Ser Ala Pro Pro Glu Tyr Asp lie Asn Glu Tyr Lys Asp Tyr Phe 

310 315 320 

Gin Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Glv Ser Tyr Thr Val 
325 330 ' 335 

Asn Glu Asn Lys Phe Asn Glu He Tyr Lys Lys Leu Tyr Ser Phe Thr 
340 345 350 

Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tvr 
355 360 365 

Phe lie Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 380 

He Tyr Thr Val Ser Glu Gly Phe Asn He Gly Asn Leu Ala Val Asn 
385 390 395 400 

Asn Arg Gly Gin Ser lie Lys Leu Asn Pro Lys He He Asp Ser He 
405 410 415 

Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 
420 425 430 

He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arq Val 
435 440 445 * 

Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
450 455 1 



460 



Asn Asp He Asn Thr Pro Lys Glu He Asp Asp Thr Thr Asn Leu Asn 

465 470 475 480 

Asn Asn Tyr Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 

485 490 495 

Gin Thr He Pro Gin lie Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
500 505 sio 

Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
515 520 525 

Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 



- 361 - 



"0 535 



54 0 



Gin Lys Val Pro Glu Gly Glu Thr Asn lie Ser Leu Thr Ser Ser lie 

550 555 560 

Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp lie Phe Phe Ser Ser Glu 
565 570 575 

Phe lie Asp Thr lie Asn Lys Pro Val Asn Ala Ala Leu Phe lie Asp 
580 585 5 9o * 

Trp He Ser Lys Val He Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
595 600 G05 

Ser Thr Val Asp Lys He Ala Asp lie Ser Leu lie Val Pro Tyr Val 

615 620 

Gly Leu Ala Leu Asn lie lie lie Glu Ala Glu Lys Gly Asn Phe Glu 

630 63S 640 

Glu Ala Phe Glu Leu Leu Gly Val Gly He Leu Leu Glu Phe Val Pro 
645 650 655 

Glu Leu Thr lie Pro Val He Leu Val Phe Thr lie Lys Ser Tyr Ue 
660 66S 670 

Asp ser Tyr Glu Asn Lys Aon Lys Ala He Lys Ala lie Asn Asn Ser 
675 680 665 

Leu lie Glu Arg Glu Ala Lys Trp Lys Glu lie Tyr Ser Trp He Val 



700 



Ser Asn Trp Leu Thr Arg He Asn Thr Gin Phe Asn Lys Arg Lys Glu 

710 715 720 

Gin Met Tyr Gin Ala Leu Gin Asn Gin Val Asp Ala He Lys Thr Ala 
725 730 735 

He Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg Leu 
740 745 750 

Glu Ser Glu Tyr Asn He Asn Asn He Glu Glu Glu Leu Asn Lys Lys 
755 760 76S 

Val Ser Leu Ala Met Lys Asn He Glu Arg Phe Met Thr Glu Ser Ser 

' '0 775 



780 



He Ser Tyr Leu Met Lys Leu He Asn Glu Ala Lys Val Gly Lys Leu 
785 790 795 eoo 

Lys Lys Tyr Asp Asn His Val Lys Ser Asp Leu Leu Asn Tyr He Leu 
805 810 815 

Asp His Arg Ser He Leu Gly Glu Gin Thr Asn Glu Leu Ser Asp Leu 
820 825 830 

Val Thr Ser Thr Leu Asn Ser Ser He Pro Phe Glu Leu Ser Ser Tyr 
835 840 845 

Thr Asn Asp Lys He Leu He He Tyr Phe Asn Arg Leu Tyr Lys Lys 
HbO 855 



860 



He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn Asn Lys Phe 
865 870 875 8 80 

He Asp He Ser Gly Tyr Gly Ser Asn He Ser He Asn Gly Asn Val 
885 890 895 

Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr Asn Ser Arg 
900 905 910 
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10 



15 
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Leu Ser Glu Val Asn lie Ala Gin Asn Asn Asp He He Tyr Asn Ser 
915 920 925 

Arg Tyr Gin Asn Phe Ser He Ser Phe Trp Val Arg He Pro Lys His 
930 935 940 

Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr lie He Asn Cys Met 
945 950 955 960 

Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr Val Arq Asp 
9 65 970 975 

Cys Glu He lie Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 
380 985 990 

Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn Tyr lie Asn 
995 1000 10 05 

LyS Tn?n Ile PhS Val Thr Ile Thr Asn Asn Ar< 3 Leu Gly Asn Ser Arg 
1010 1015 xo20 

He Tyr He Asn Gly Asn Leu lie Val Glu Lys Ser He Ser Asn Leu 
102S 1030 1035 1040 

Gly Asp lie His Val Ser Asp Asn lie Leu Phe Lys He Val Gly Cvs 
1045 1050 ' 1055 

Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lvs Val Phe Asn Thr 
10^0 1065 * 1070 

Glu Leu Asp Lys Thr Glu lie Glu Thr Leu Tyr Ser Asn Glu Pro Asp 
1075 1080 1085 

Pr ° ]nqo Ile LyS ASn TYr Tl ' P ° ly ASn Tyr Leu Leu T V r Asn L Y* 



40 



45 



1100 



1090 1095 

Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr lie Thr Leu 
1105 i110 1115 U20 

Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Glv Val Thr Glu Gly 
H25 H30 ' 1135 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val lie He 
H40 H45 1150 

Arg Lys Asn Gly Pro lie Asp lie Ser Asn Thr Asp Asn Phe Val Arq 
il5S H60 H65 

Lys Asn Asp Leu Ala Tyr lie Asn Val Val Asp Arg Gly Val Glu Tyr 
■ W 1170 1175 ileo 

Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He He Arq Thr 
1185 I 190 1195 " 1200 

:0 ser Asn Leu Asn Asp Ser Leu Gly Gin lie lie Val Met Asp Ser lie 

!205 i 2 io 12 15 



AO 



65 



70 



Gly 


Asn Asn Cys Thr 
1220 


Met 


Asn Phe Gin Asn 

1225 


Asn 


Asn 


Gly 


Ser Asn 
1230 


He 


Gly 


Leu Leu Gly 
1235 


Phe 


His 


Ser Asn Asn Leu 
1240 


Val 


Ala 


Ser 
1245 


Ser 


Trp 


Tyr 


Tyr 


Asn Asn lie 
1250 


Arg 


Arg 


Asn Thr Ser Ser 
1255 


Asn 


Gly 
1260 


Cys 


Phe 


Trp 


Ser 


Ser 
1265 


He Ser Lys 


Glu 


Asn 

1270 


Gly Trp Lys Glu 

i 














{2) 


INFORMATION 


FOR 


SEQ 


ID NO : 7 2 : 
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45 



3D 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(ii> MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 108.. 1451 



TAT AAA AAA ATT AAA GAT AGT TCT ATT TTA GAT ATG CGA TAT GAA AAT 
Tyr Lys Lys lie Lys Asp Ser Ser lie Leu Asp SI? Arg Tyr £n 

AAT AAA TTT ATA GAT ATC TCT GGA TAT GGT TCA AAT ATA AGC ATT AAT 
Asn Lys Ph . ue Asp lie Ser Gly Tyr Gly Ser A^n lie Sor Til JJJ 
55 60 65 

GGA AAC GTA TAT ATT TAT TCA ACA AAT AGA AAT CAA TTT GGA ATA TAT 
Gly Asn Val Tyr He Tyr Ser Thr Asn Arg Asn Gly lie lyr 

40 75 80 

AAT AGT AGG CTT AGT GAA GTT AAT ATA GCT CAA AAT AAT GAT ATT ATA 
Asn ser Arg Leu Ser Glu Vol Asn ii. Ala Gin Asn Asp ill lie 



95 



TAC AAT AGT AGA TAT CAA AAT TTT AGT ATT AGT TTC TGG GTA AGG ATT 
ryr Asn Ser Arg Tyr Gin Asn Phe Ser He Ser Phe Trp vll Arg Til 
105 J-10 115 



130 



AAT TGT ATG GGG AAT AAT AAT TCG GGA TGG AAA ATA TCA CTT AGA ACT 
Asn Cys Met Gly Asn Asn Asn Ser Gly Trp Lys lie Ser Leu Arg Thr 
135 140 145 

uIT ^ AT TGT GAA ATA ATT TGG ACT TTA CAA GAT ACT TCT GGA AAT 

val Arg Asp Cys Glu lie lie Trp Thr Leu Gin Asp Thr ler Gly US 

60 155 160 

AAG GAA AAT TTA ATT TTT AGG TAT GAA GAA CTT AAT AGG ATA TCT AAT 
Lys Glu Asn Leu He Phe Arg Tyr Glu Glu Leu Asn Arg lie III HI 



175 



TAT ATA AAT AAA TGG ATT TTT GTA ACT ATT ACT AAT AAT AGA TTA err 
Tyr lie Asn Lys Trp lie Phe Val Thr He Thr j£J %l Arg lIu Gly 

185 190 19 s 



60 

116 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
15 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 

20 i 

si si si si s: si si si ss s s ^ « s - - ... 

55 £ SI S S K 2? S K 2 S S ?» £ S S E =» 

" 5 30 3 5 



260 



308 



356 



404 



452 



5» S £ Si £ £ E K S SI 12 £ "K £ S JS JK 500 

120 125 



548 



596 



644 



692 



70 ^ T I CT AGA ATT TAC ATC AAT GGA AAT TTA ATA GTT GAA AAA TCA ATT 

™ Asn Ser Arg He Tyr lie Asn Gly Asn Leu lie Val g?5 JJi Ser ill *° 
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30 



40 



45 



50 



:>:> 



788 



836 



884 



932 



200 205 210 

TCG AAT TTA GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT 
Ser Asn Leu Gly Asp He His Val Ser Asp Asn He Leu Phe Lys He 
215 220 225 

GTT GGT TGT GAT GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT 
Val Gly Cys Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lys Val 
230 235 240 

TTT AAT ACG GAA TTA GAT AAA ACA GAA ATT GAG ACT TTA TAT AGT AAT 
Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn 
245 250 255 

b GAG CCA GAT CCA AGT ATC TTA AAA AAC TAT TGG GGA AAT TAT TTG CTA 

Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu 
260 265 270 275 

TAT AAT AAA AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT 980 
~U Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tvr 

280 285- 290 

ATT ACT CTG AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT 
He Thr Leu Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val 
295 300 305 

ACT GAA GGC TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA 
...r Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Glv Val Glu 
310 315 320 

GTC ATT ATA AGA AAA AAT GGT CCT ATA GAT ATA TCT AAT ACA GAT AAT 
Vai Ile Ile Ar 9 L YS Asn Gly Pro He Asp He Ser Asn Thr Asp Asn 
325 3 30 335 

TTT GTT AGA AAA AAC GAT CTA GCA TAC ATT AAT GTA GTA GAT CGT GGT 
Phe Val Arg Lys Asn Asp Leu Ala Tyr Ile Asn Val Val Asp Arq Glv 
340 345 350 355 

GTA GAA TAT CGG TTA TAT GCT GAT ACA AAA TCA GAG AAA GAG AAA ATA 1220 
;al Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lvs Glu Lys lie 
360 365 * 370 

ATA AGA ACA TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG 
^ie Arg Thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He Ile Val Met 
375 380 385 

GAT TCA ATA GGA AAT AAT TGC ACA ATG AAT TTT CAA AAC AAT AAT GGG 
Asp Ser Ile Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly 
390 395 400 

AGC AAT ATA GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT 13 64 

Ser Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser 
405 410 4 i5 

AGT TGG TAT TAT AAC AAT ATA CGA AGA AAT ACT AGC AGT AAT GGA TGC 1412 
Ser lrp Tyr Tyr Asn Asn Ile Arg Arg Asn Thr Ser Ser Asn Gly Cys 
420 425 430 435 

TTT TGG AGT TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGAAAGCTT 14 6 0 

C>0 Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu 

440 445 

(2) INFORMATION FOR SEQ ID NO: 73: 

^° <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



1028 



1076 



1124 



1172 



1268 



1316 



70 



(ii) MOLECULE TYPE : protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
5 10 



15 



He Glu Gly Arg His Met Ala Ser Met Ala lie Leu lie lie Tyr Phe 
20 2 5 30 

Asn Arg Leu Tyr Lys Lys lie Lys Asp Ser Ser lie Leu Asp Met Arg 
Jb 40 45 

Tyr Glu Asn Asn Lys Phe lie Asp lie Ser Gly Tyr Gly Ser Asn He 

55 60 
Ser lie Asn Gly Asn Val Tyr lie Tyr Ser Thr Asn Arg Asn Gin Phe 

75 80 
Gly He Tyr Asn Ser Arg Leu Ser Glu Val Asn He Ala Gin Asn Asn 

85 90 95 

Asp He lie Tyr Asn Ser Arg Tyr Gin Asn Phe Ser He Ser Phe Trp 

Val Arg He Pro Lys His Tyr Lys Pro Met Asn His Asn Arg Glu Tyr 

Thr lie He Asn Cys Met Gly Asn Asn Asn Ser Gly Trp Lys lie Ser 



140 



Leu Arg Thr Val Arg Asp Cys Glu He He Trp Thr Leu Gin Asp Thr 

155 160 
Ser Gly Asn Lys Glu Asn Leu He Phe Arg Tyr Glu Glu Leu Asn Arg 



155 170 175 



He Ser Asn Tyr He Asn Lys Trp He Phe val Thr lie Thr Asn Asn 
180 I" lpo 

Arg Leu Gly Asn Ser Arg He Tyr lie Asn Gly Asn Leu lie Val Glu 

195 20° 205 

Lys Ser He Ser Asn Leu Gly Asp He His Val Ser Asp Asn He Leu 



0 



Phe Lys lie Val Gly Cys Asp Asp Glu Thr Tyr Val Gly He Arg Tyr 
230 "5 240 

Phe Lys Val Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu 

245 250 255 

Tyr Ser Asn Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn 

60 265 270 

Tyr Leu Leu Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys 



285 



Asp Lys Tyr He Thr Leu Asn Ser Gly lie Leu Asn He Asn Gin Gin 
30 295 -joe 

Arg Gly Val Thr Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu 

310 315 320 

Gly Val Glu Val lie He Arg Lys Asn Gly Pro He Aso He Ser Asn 

325 "0 " 335 

Thr Asp Asn Phe Val Arg Lys Asn Asp Leu Ala Tyr He Asn Val Val 

345 350 

Asp Arg Gly Val Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys 
33 36 0 365 
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Glu Lys lie He Arg Thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He 
370 375 380 

He Val Met Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn 
385 390 395 400 

Asn Asn Gly Ser Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu 
405 410 415 

Val Ala Ser Ser Trp Tyr Tyr Asn Asn He Arg Arg Asn Thr Ser Ser 
420 425 430 

Asn Gly Cys Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

SEQUENCE DESCRIPTION: SEQ ID NO:74: 

CGCCATGGCT ATTCTAATTA TATATTTTAA TAG 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: other nucleic acid 
\A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 75: 

GCAAGCTTTC ATTCTTTCCA TCCATTCTC 

>2) INFORMATION FOR SEQ ID NO : 76 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 3891 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

ATG CCA GTT AAT ATA AAA AAC TTT AAT TAT AAT GAC CCT ATT AAT AAT 
Met Pro Val Asn He Lys Asn Phe Asn Tyr Asn Asp Pro lie Asn Asn 
1 5 io 



15 



GAT GAC ATT ATT ATG ATG GAA CCA TTC AAT GAC CCA GGG CCA GGA ACA 
Asp Asp He He Met Met Glu Pro Phe Asn Asp Pro Gly Pro Civ Thr 
20 25 30 

TAT TAT AAA GCT TTT AGG ATT ATA GAT CGT ATT TGG ATA GTA CCA GAA 
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Tyr Tyr Lys Ala Phe Arg lie lie Asp Arg He Trp U e Val Pro Glu 
■ J3 40 



45 



Ar? Phe tS JS G?v Phi S" GAC CAA TTT ^ GCC AGT ACA «» 

Hrq Fne Tnr Tyr Gly Phe Gin Pro Asp Gin Phe Asn Ala Ser Thr Gly 

55 60 

GTT TTT ACT AAA GAT GTC TAC GAA TAT TAC GAT CCA ACT TAT TTA AAA 
Val Phe Ser Lys Asp Val Tyr Glu Tyr Tyr Asp Pro Thr Tyr Leu Lys 



75 



80 



tS Asn tfl GAT TTT TTA ACA A TG ATT AAA TJA TTT 

Thi Asp Ala Glu Lys Asp Lys Phe Leu Lys Thr Met lie Lys Leu HI 

8 5 ~ ~ 



90 



95 



AAT AGA ATT AAT TCA AAA CCA TCA GGA CAG AGA TTA CTG GAT ATG ATA 
Asn Arg lie Asn Ser Lys P ro Ser Gly Gin Arg Leu lIu AsJ Tie 



110 



GTA GAT GCT ATA CCT TAT CTT GGA AAT GCA TCT ACA CCG CCC rzr aaa 
Val Asp Ala He Pro Tyr Leu Gly Asn Ala lei Thr Pro £o Asp L^ 

120 125 

Phi ^ GT T ° CA AAT GTA TCT ATT AAT AAA AAA ATT ATC CAA 

Phe Ala Ala Asn Val Ala Asn Val Ser lie Asn Lys ™ ill !J §fj 

135 140 

SI gIC All G^ Asl SS n A ?** G -? T TTA ATG ACA TTA ATA ATA 

U5 y Glu Asp G £ n Ile L V S G1 V Leu Met Thr Asn Leu He He 

155 160 

HI Gly Pro Pro SIT J™ f T GAT TTT ACT GAT AGT ATG ATT 

*-ne uiy Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met Ile 

165 170 175 

£« All Glv Hxl £ C P CA tT A GAA GGA TTT GGT GCA AGA A™ ATG 

Met Aon uly His Sei Pro Ile Ser Glu Gly Phe Gly Ala Arg Met Met 

180 1B5 190 

ATA AGA TTT TGT CCT AGT TGT TTA AAT GTA TTT AAT AAT GTT CAG GAA 
He Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asl vIT G^n Si 

200 205 

A^n T T ^ A TCT A ™ TTT AGT AGA CGC GCG TAT TTT GCA GAT CCA 

Asn Lys Asp Thr Ser lie Phe Ser Arg Arg Ala Tyr Phe Ala Asp Pro 

GCT CTA ACG TTA ATG CAT GAA CTT ATA CAT GTG TTA CAT GGA TTA TAT 
Ala Leu Thr Leu Met His Glu Leu lie His Val Leu nil Giy leu £yr 

230 235 240 

GlC tTI f AG tT A AGT W TTA CCA ATT ACT CCA AAT ACA AAA GAA TTT 
Gly lie Lys He Ser Asn Leu Pro Ile Thr Pro Asn Thr Lys Glu Phe 

245 250 255 

Phe Mlt ml f C ^ AT CCT GTA CAA GCA GAA GAA C ™ TAT ACA TTC 

Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tvr Thr Phe 

260 265 2 70 

GW Clt m T n CT AGT GTT ATA AGT CCT TCT ACG GAT ATG AAT ATT 

Gly Gly His Asp Pro Ser Val lie Ser Pro Ser Thr Asp Met Asn lie 
" 1S 280 285 



£n ^ G TTA CAA AAT TTT CAA GAT ATA GCT *™ AGG CTT AAT 

Tyi Asn Lys Ala Leu Gin Asn Phe Gin Asp He Ala Asn Arg Leu Asn 



300 



ATT GTT TCA AGT GCC CAA GGG AGT GGA ATT GAT ATT TCC TTA TAT AAA 
o Val Ser Ser Ala Gin Gly Ser Gly lie Asp lie Ter "u Tyr SJi 
310 315 3^ 0 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



67; 



720 



768 



816 



864 



912 



960 
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CAA ATA TAT AAA AAT AAA TAT GAT TTT GTT GAA GAT CCT AAT GGA AAA 1008 

Gin lie Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

5 TAT AGT GTA GAT AAG GAT AAG TTT GAT AAA TTA TAT AAG GCC TTA ATG 10 56 

Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

TTT GGC TTT ACT GAA ACT AAT CTA GCT GGT GAA TAT GGA ATA AAA ACT H04 

l» Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly lie Lys Thr 

355 360 365 

AGG TAT TCT TAT TTT AGT GAA TAT TTG CCA CCG ATA AAA ACT GAA AAA 

Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro He Lys Thr Glu Lys 
I -> 370 375 - ~ - 7 



20 



30 



40 



60 



65 



380 



TAT TTA CAT GCT CAA ACA TTT CCT TCT AAT ATA GAA AAT CTA CAA CTA 

Tyr Leu His Ala Gin Thr Phe Pro Ser Asn He Glu Asn Leu Gin Leu 

54B 550 555 560 

ACG AAT TCA TTA AAT GAT GCT TTA AGA AAT AAT AAT AAA GTC TAT ACT 

Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 

565 570 575 

TTT TTT TCT ACA AAC CTT GTT GAA AAA GCT AAT ACA GTT GTA GGT GCT 

Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Glv Ala 

580 585 590 



1152 



1200 



1248 



TTG TTA GAC AAT ACA ATT TAT ACT CAA AAT GAA GGC TTT AAC ATA GCT 
Leu Leu Asp Asn Thr He Tyr Thr Gin Asn Glu Gly Phe Asn lie Ala 
385 390 395 400 

AGT AAA AAT CTC AAA ACG GAA TTT AAT GGT CAG AAT AAG GCG GTA AAT 
Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 
405 410 415 

25 AAA GAG GCT TAT GAA GAA ATC AGC CTA GAA CAT CTC GTT ATA TAT AGA 12 96 

Lys Glu Ala Tyr Glu Glu lie Ser Leu Glu His Leu Val lie Tyr Arq 
420 425 430 

ATA GCA ATG TGC AAG CCT GTA ATG TAC AAA AAT ACC GGT AAA TCT GAA 1344 
lie Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 
435 440 445 

CAG TGT ATT ATT GTT AAT AAT GAG GAT TTA TTT TTC ATA GCT AAT AAA 13 5^ 

Gin Cys lie He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lys 
j:> 4 50 455 460 

GAT AGT TTT TCA AAA GAT TTA GCT AAA GCA GAA ACT ATA GCA TAT AAT 14 4 0 

Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr He Ala Tyr Asn 
465 470 475 480 

ACA CAA AAT AAT ACT ATA GAA AAT AAT TTT TCT ATA GAT CAG TTG ATT 14 88 

Thr Gin Asn Asn Thr lie Glu Asn Asn Phe Ser He Asp Gin Leu He 
485 490 495 

4? TTA GAT AAT GAT TTA AGC AGT GGC ATA GAC TTA CCA AAT GAA AAC ACA 1536 

i,eu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 505 510 

GAA CCA TTT ACA AAT TTT GAC GAC ATA GAT ATC CCT GTG TAT ATT AAA 15 84 

3U Glu Pro Phe Thr Asn Phe Asp Asp lie Asp He Pro Val Tyr He Lys 
515 520 525 

CAA TCT GCT TTA AAA AAA ATT TTT GTG GAT GGA GAT AGC CTT TTT GAA 16 3 ^ 

Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 
D:> 530 535 540 



1680 



1728 



1776 



TCA CTT TTT GTA AAC TGG GTA AAA GGA GTA ATA GAT GAT TTT ACA TCT 18 24 

Ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
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15 



35 



65 



S9 5 600 



605 



rtf I CC tu h CAA ^ AGT ACT ATA GAT ^ GTT TCA GAT GTA TCC ATA 
Glu Ser Thr Gin Lys Ser Thr lie Asp Lys Val llr j£J !2 Ser lie 

615 



620 



650 655 



ss ss si s s s: s s s s s sir s; - jz s 

66 5 



670 



40 A^n tT T ^ C ^T T GAT TTT *»* GAT ATA GAT TTT AAA CTT AAT CAA AGT 

Asn Ue Asn He Asp Phe Asn Asp lie Asp Phe Lys Leu £sn Gin Ser 

760 7 6 5 

ATA AAT TTA GCA ATA AAC AAT ATA GAT GAT TTT ATA AAC CAA TGT rrr 
45 116 To LSU Ala Ue Asn Asn Ile AS P As P ^ lie JSS gK £J I" 

lie Kr lIu mI? f T f AT ° ATT CCA TTA GCT GTA AAA AAG TTA 

lie Ser Tyr Leu Met Asn Arg Met He Pro Leu Ala Val Lys Lys Leu 

50 790 7 95 8oo 

AAA GAC TTT GAT GAT AAT CTT AAG AGA GAT TTA TTG GAG TAT ATA TftT 
Lys Asp Phe Asp Asp Asn Leu Lys Arg Asp Leu 22 Glu lyl lie AsJ 



810 



815 



Thr l M TTA CTT GAT GAA GTA AAT ATT CTA AAA TCA AAA 

Thr Asn Glu Leu Tyr Leu Leu Asp Glu Val Asn lie Leu Lys Ser L^ 
820 825 830 



845 



Thr w° ^ AC ATT TTA ATA CAA GTT TTT AAT AAT TAT ATT AGT AAT 

Thr Lys Asp Thr lie Leu lie Gin Val Phe Asn Asn Tyr ill Sel A^n 

o b 5 



860 



ATT AGT AGT AAT GCT ATT TTA AGT TTA AGT TAT AGA GGT GGG CGT TTA 
le Ser Ser Asn Ala lie Leu Ser Leu Ser Tyr Arg Sy Gly Arg lIu 
70 870 875 880 



1872 



lie ill Pro Tvl tie r? A E" ST ^ MT GTA GGA AAT GAA ACA GCT 
lie He Pro Tyr He Gly Pro Ala Leu Asn Val Gly Asn Glu Thr Ala 

10 630 635 640 

AAA GAA AAT TTT AAA AAT GCT TTT GAA ATA fil-T rr» r-rr r-^ 
Lys Glu Asn Phe Lys Asn Ala Phi Glu S Gly £J J£ Ala" J£ ITu 



2016 



» L™ £S E K S Sf SI £^ £ SI SI ill SI? S S S 

680 685 

AAT GCT TTA AAG AAA AGG GAT CAA AAA TGG ACA GAT ATG TAT GGT TTC 
25 AS " A go L€U LyS Lys Ar * AS Q ? Gln ^ Trp Thr As? Me? Tyl g?v IS 

595 700 

iK S K SS S 2S S S SI 2 E HI SJ ?s JK 

E s; is s s i™ - - ~ - a: s s si 

725 730 735 

AAA ATA ATA GAA GAT CAA TAT AAT AGA TAT AGT GAA GAA GAT AAA ATG 
Lys He He Glu Asp Gin Tyr Asn Arg Tyr Ser Glu cfl AsJ £yt S« 

745 7 50 



3064 



112 



160 



2256 



2304 



2352 



2400 



2446 



2496 



<-o s is ^ ss s s s e s: s e si si e 21 £ »« 



2592 



2640 
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rl* it I I CT GGA TAT GGT GCA ACT ATG AAT °TA GGT TCA GAT GTT 
lie Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp vll 
885 690 g 9 | 

3 ATC TTT AAT GAT ATA GGA AAT GGT CAA TTT AAA TTA AAT AAT TCT GAA 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Ju A^n A^n Ser Glu 
900 905 910 

10 AAT o GT AAT ATT ACG GCA CAT CAA ACT AAA TTC GTT GTA TAT GAT ACT 

"0 Asn ser Asn lie Thr Ala His Gin Ser Lys Phe vll £f £J Ser 

920 925 

Mir Phi AAT T 3 T AGC ATT AAC TGG GTA AGG ACT CCT AAA TAT 

15 "Jo P ASn Phe Tr P Val Arg '^r Pro Lys 

935 940 

£1 £1 GAT A T A CAA ACT TAT CTT CAA AAT GAG TAT ACA ATA ATT 

Asn Asn Asn Asp lie Gin Thr Tyr Leu Gin Asn Glu Tyr Thr S ul 

Ser £J n A ^ T GAC T A GGA TGG *** GTA TCT ATT AAG GGA AAT 

Ser Cy S lie Lys Asn Asp Ser Gly Trp Lys Val Ser lie Lys Gly ten 

yz>5 970 



45 



975 



Arg iTe ^ ™ A ™ GAT GTT AAT GCA AAA TCT AAA TCA ATA 

9fln ASP Val Asn Ala L V 5 Ser Ser He 

?ou 985 



990 



Phi Si S* ^r Ser lie *** a" *** ATA TCA GAT TAT ATA AAT AAA 

e yne Glu Tyr Ser lie Lys Asp Asn He Ser Asp Tyr lie Asn Ly< 

" 3 1000 ' 



loo;; 



S sir uJ £ nl f T GAT AGA TTA GGT AAC GCA *AT ATT 

•"'5 1010 Tnfc A8n ASP Al ' 9 Leu G1 y Asn Ala Asn lie 

1015 1020 

Tyl lie £1 G?C cf T G ?** AGT GAA ^ ATT TTA AAC TTA GAT 

Tyr lie Asn Gly Ser Leu Lys Lys Ser Glu Lys He Leu Asn Leu Asp 

40 1030 1035 10 £ 0 



a™ tT T AAT TCT AGT ^ GAT ATA GAC TTC AAA TTA ATT AAT TCT /pa 
A, 9 He Asn ser ser Asn Asp lie Asp Phe Lys lIu ill £1 A £ 

1045 "50 1055 

GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT AAT ATT TTT rrr ,r. 
~sp Tnr Thr Lys Phe Val Trp He Lys Asp pII £1 Til III oil Arg 
1060 1065 1070 " J 



Phi AG J *** ^ T TCT ATG GGG GAA ACT GCA CCA CGT ACA AAC TTT AAT 
Phe ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr ten Phi tel 
1125 1130 



2688 



2736 



2784 



283: 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



50 lIu t^ ^ G It 153 ™ f TA IAT JOG ATT CAA TCA TCT 



3312 



Olu ^ A,„ M a Thr Blu val ™ - - - ;J J - ™ «T 

1080 1085 

M s ™ - s - - - - - - 

Si,S £ 2J S !S St S2 Si 2£ ™ ™ S ?S »» 

60 1110 "15 ' 1120 



3408 



1135 
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30 



40 



45 



50 



60 



65 



70 



AAT GCA GCA ATA AAT TAT CAA AAT TTA TAT CTT GGT TTA CGA TTT ATT 
Asn Ala Ala He Asn Tyr Gin Asn Leu Tyr Leu Gly Leu Arg Phe lie 
1140 H45 1150 

ATA AAA AAA GCA TCA AAT TCT CGG AAT ATA AAT AAT GAT AAT ATA GTC 
lie Lys Lys Ala Ser Asn Ser Arg Asn He Asn Asn Asp Asn He Val 
1155 H60 lies 



GAA TAA 
Glu 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Met Pro Val Asn He Lys Asn Phe Asn Tyr Asn Asp Pro lie Asn Asn 
1 5 io 



15 



Asp Asp He He Met Met Glu Pro Phe Asn Asp Pro Gly Pro Gly Thr 

20 25 30 

Tyr Tyr Lys Ala Phe Arg He He Asp Arg He Trp He Val Pro Glu 

35 40 45 

Arg Phe Thr Tyr Gly Phe Gin Pro Asp Gin Phe Asn Ala Ser Thr Glv 

50 55 60 7 

Val Phe Ser Lys Asp Val Tyr Glu Tyr Tyr Asp Pro Thr Tyr Leu Lys 

65 70 75 bo 

Thr Asp Ala Glu Lys Asp Lys Phe Leu Lys Thr Met He Lys Leu Phe 

- 372 - 



3456 



3504 



10 ^ 2?* GAT TAT ATA TAT ° TT ATT GAT ^ ATT T CT GAT GAA 3 5 52 

10 Arg Glu Gly Asp Tyr He Tyr Leu Asn lie Asp Asn He Ser Asp Glu 
1170 H75 11Q0 

TCT TAC AGA GTA TAT GTT TTG GTG AAT TCT AAA GAA ATT CAA ACT CAA 
Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys Glu He Gin Thr Gin 
n 1185 H90 1195 1200 

TTA TTT TTA GCA CCC ATA AAT GAT GAT CCT ACG TTC TAT GAT GTA CTA 
Leu Phe Leu Ala Pro He Asn Asp Asp Pro Thr Phe Tyr Asp Val Leu 
20 1205 1210 1215 

CAA ATA AAA AAA TAT TAT GAA AAA ACA ACA TAT AAT TGT CAG ATA CTT 
Gin lie Lys Lys Tyr Tyr Glu Lys Thr Thr Tyr Asn Cys Gin He Leu 
1220 1225 1230 

25 TGC GAA AAA GAT ACT AAA ACA TTT GGG CTG TTT GGA ATT GGT AAA TTT 

Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe Gly He Gly Lys Phe 
1235 1240 1245 

GTT AAA GAT TAT GGA TAT GTT TGG GAT ACC TAT GAT AAT TAT TTT TGC 3 792 

Val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr Asp Asn Tyr Phe Cys 
1-50 1255 1260 

ATA ACT CAG TGG TAT CTC AGA AGA ATA TCT GAA AAT ATA AAT AAA TTA 3 84 0 

IS ,1^ ln Trp Tyr Leu Arg Ar 9 Ile Ser Glu Asn He Asn Lys Leu 

J " 1265 1270 1275 1280 

AGG TTG GGA TGT AAT TGG CAA TTC ATT CCC GTG GAT GAA GGA TGG ACA 3 88 8 

Arg Leu Gly Cys Asn Trp Gin Phe Ile Pro Val Asp Glu Gly Trp Thr 
I 265 1290 1295 



3600 



3648 



3696 



3744 



3894 
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85 90 , 95 

Asn Arg He Asn Ser Lys Pro Ser Gly Gin Arg Leu Leu Asp Met lie 
- 100 105 no 

Val Asp Ala He Pro Tyr Leu Gly Asn Ala Ser Thr Pro Pro Asp Lys 
H5 120 125 

Phe Ala Ala Asn Val Ala Asn Val Ser He Asn Lys Lys He He Gin 
lu 130 135 140 

Pro Gly Ala Glu Asp Gin He Lys Gly Leu Met Thr Asn Leu He He 
145 150 155 i 6 o 

Phe Gly Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met He 
165 170 175 

Met Asn Gly His Ser Pro He Ser Glu Gly Phe Gly Ala Arg Met Met 
180 185 190 

He Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asn Val Gin Glu 
i95 200 205 

Asn Lys Asp Thr Ser He Phe Ser Arg Arg Ala Tyr Phe Ala Asp Pro 
210 215 220 

Ala Leu Thr Leu Met His Glu Leu He His Val Leu His Gly Leu Tvr 
225 23 0 235 2^0 

Gly He Lys He Ser Asn Leu Pro He Thr Pro Asn Thr Lys Glu Phe 
245 250 255 

Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly His Asp Pro Ser Val He Ser Pro Ser Thr Asp Met Asn He 
275 280 285 

d(\ Tyr ^ n LyS Ala Leu Gln Asn Phe Gln As P Ile A la Asn Arg Leu Asn 

4U 290 295 300 



15 



20 



30 



3:> 



45 



50 



He Val Ser Ser Ala Gln Gly Ser Gly He Asd Ile Ser Leu Tyr Lys 
35 310 315 320 

Gln He Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly He Lys Thr 
355 360 365 

S5 ^ T?n S<2r PhS GlU Tyr LeU Pr ° Pr ° Ile ^ Thr Glu Lys 

370 375 3eo 

Leu Leu Asp Asn Thr He Tyr Thr Gln Asn Glu Gly Phe Asn Ile Ala 
385 390 395 400 

Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gln Asn Lys Ala Val Asn 
405 410 415 

Lys Glu Ala Tyr Glu Glu He Ser Leu Glu His Leu Val Ile Tyr Ara 

65 420 425 430 

He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 
435 440 445 



60 



70 ^ Ile IlG Val Asn AGn Glu As P Leu phe Ph « He Ala Asn Lys 

/U 450 455 46O 
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Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr lie Ala Tyr Asn 

470 475 480 

Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser U e Asp Gin Leu He 
48S 490 495 

Leu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 505 510 

Glu Pro Phe Thr Asn Phe Asp Asp lie Asp lie Pro Val Tyr He Lys 

Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 

535 54 0 

Tyr Leu His Ala Gin Thr Phe Pro Ser Asn He Glu Asn Leu Gin Leu 

550 555 5G0 

Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 
565 570 575 

Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Gly Ala 
580 585 590 

Ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
595 6 oo 605 

Glu Ser Thr Gin Lys Ser Thr He Asp Lys Val Ser Asp Val Ser lie 

lie lie Pro Tyr He Gly Pro Ala Leu Asn Val Gly Asn Glu Thr Ala 
630 635 640 

Lys Glu Asn Phe Lys Asn Ala Phe Glu He Gly Gly Ala Ala He Leu 
645 6 50 655 

Met Glu Phe lie Pro Glu Leu He Val Pro He Val Gly Phe Phe Thr 
660 665 670 

Leu Glu ser Tyr Val Gly Asn Lys Gly His He He Met Thr He Ser 
675 680 6 85 

Asn Ala Leu Lys Lys Arg Asp Gin Lys Trp Thr Asp Met Tyr Gly Leu 

695 7 00 

He Val ser Gin Trp Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He 

710 715 720 

Lys Glu Arg Met Tyr Asn Ala Leu Asn Asn Gin Ser Gin Ala He Glu 
725 730 735 

Lys He He Glu Asp Gin Tyr Asn Arg Tyr Ser Glu Glu Asp Lys Met 
740 745 750 

Asn He Asn He Asp Phe Asn Asp He Asp Phe Lys Leu Asn Gin Ser 
7 55 760 765 

lie Asn Leu Ala He Asn Asn He Asp Asp Phe lie Asn Gin Cys Ser 



780 



lie Ser Tyr Leu Met Asn Arg Met He Pro Leu Ala Val Lys Lys Leu 
5 790 795 aoo 

Lys Asp Phe Asp Asp Asn Leu Lys Arg Asp Leu Leu Glu Tyr He Asp 
805 810 815 

Thr Asn Glu Leu Tyr Leu Leu Asp Glu Val Asn He Leu Lys Ser Lys 
820 825 8 3o 

Val Asn Arg His Leu Lys Asp Ser He Pro Phe Asp Leu Ser Leu Tyr 
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40 



50 



GO 



65 



70 



835 840 845 

Thr Lys Asp Thr He Leu He Gin Val Phe Asn Asn Tyr He Ser Asn 
850 855 860 

He Ser Ser Asn Ala lie Leu Ser Leu Ser Tyr Arg Gly Gly Arg Leu 
865 8™ 875 880 

He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp Val 
885 890 895 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Leu Asn Asn Ser Glu 
900 905 910 

Asn Ser Asn He Thr Ala His Gin Ser Lys Phe Val Val Tvr Asp Ser 
915 920 925 ' 

Met Phe Asp Asn Phe Ser He Asn Phe Trp Val Arg Thr °ro Lvs Tvr 
930 935 94 b ^ 

Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn Glu Tyr Thr He lie 
945 950 955 96Q 

Ser c:ys lie Lys Asn Asp Ser Gly Trp Lys Val Ser lie Lys Gly Asn 
965 970 975 

Aug He He Trp Thr Leu He Asp Val Asn Ala Lys Ser Lvs Ser He 
980 985 990 

Phe Phe Glu Tyr Ser lie Lys Asp Asn He Ser Asp Tyr lie Asn Lys 
995 1000 1005 

Trp Phe Ser He Thr He Thr Asn Asp Arg Leu Gly Asn Ala Asn He 
;01 ° 1015 1020 

Tyr lie Asn Gly Ser Leu Lys Lys Ser Glu Lys He Leu Asn Leu Asp 
1025 1030 1035 1040 

Arg He Asn Ser Ser Asn Asp He Asp Phe Lys Leu lie Asn Cys Thr 
!045 1050 1055 

Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe Asn He Phe Gly Arq 
1060 1065 1070 

Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tvr Trp lie Gin Ser S-r 
1075 1080 io85 

Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro Leu Arg Tyr Asp Thr 
1090 1095 iioo 

Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn lie Tyr He Lys Tvr 
1105 X11 0 1115 ii 20 

Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr Asn Phe Asn 
1125 H30 1135 

Asn Ala Ala lie Ash Tyr Gin Asn Leu Tyr Leu Gly Leu Arg Phe II* 
1140 1145 1150 

lie Lys Lys Ala Ser Asn Ser Arg Asn lie Asn Asn Asp Asn 11^ Val 
1155 1160 H65 

Arg Glu Gly Asp Tyr lie Tyr Leu Asn lie Asp Asn lie Ser Asp Glu 
1170 1175 use 

Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys Glu He Gin Thr Gin 
1185 1190 1195 1200 

Leu Phe Leu Ala Pro lie Asn Asp Asp Pro Thr Phe Tvr Asp Val Leu 
1205 1210 1215 
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10 



30 



40 



60 



70 



Gin He Lys Lys Tyr Tyr Glu Lys Thr Thr Tyr Asn Cys Gin lie Leu 
1220 1225 1230 

Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe Gly lie Gly Lys Phe 
1 235 1240 1245 

val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr Asp Asn Tyr Ph- Cys 
1250 1255 1 y * 



1260 



lie Ser Gin Trp Tyr Leu Arg Arg lie Ser Glu Asn He Asn Lys Leu 
- 65 1270 1275 12Q0 

Arg Leu Gly Cys Asn Trp Gin Phe lie Pro Val Asp Glu Gly Trp Thr 

1285 i29o 12 £ s 

Glu 

(2) INFORMATION FOR SEQ ID NO: 78; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1535 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: DNA (genomic) 

■ ix) FEATURE : 

(A) NAME /KEY : CDS 
<B) LOCATION: 108 1526 

(:<i) SEQUENCE DESCRIPTION : SEQ ID NO: 78: 

3 _ AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 



Met Gly His 

1 



CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 
His His His His His His His His His Ser Ser Gly His lie G^ IVy 

10 15 * 



Val Gly ser Asp Val He Phe Asn Asp lie Gly Asn Gly Gin P ne Lys 

70 75 so 

7I A ^ T TCT GAA AAT AGT MT ATT ACG GCA CAT CAA AGT AAA TT- 

Leu Asn Asn Ser Glu Asn Ser Asn He Thr Ala His Gin Ser Lys Phe 

B5 90 95 



110 115 

AGG ACT CCT AAA TAT AAT AAT AAT GAT ATA CAA ACT TAT CTT CAA AAT 

Arg Thr Pro Lys Tyr Asn Asn Asn Asp lie Gin Thr Tyr Leu G^ fttn 

120 125 130 



60 
116 

164 



4S a™ StI o GC ATG GCT GAC ACA ATT TTA ATA CAA CTT TTT AAT ^ 

Aig His Met Ala Ser Met Ala Asp Thr lie Leu lie Gin Val Phe Asn 

25 30 35 

AAT TAT ATT AGT AAT ATT AGT AGT AAT GCT ATT TTA AGT TTA AGT TAT 
Asn T/r lie Ser Asn lie Ser Ser Asn Ala lie Leu Ser Leu Ser Tyr 
40 45 50 

AGA GGT GGG CGT TTA ATA GAT TCA TCT GGA TAT GGT GCA ACT ATG AAT 
Arg Gly Gly Arg Leu lie Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn 
55 55 60 55 

Sit r?I If* GAT G , T 7 A T C TTT MT GAT A TA GGA AAT GGT CAA TTT AAA 



260 



308 



356 



4 04 



SIT Sit l AT GAT AGT ATG TTT GAT AAT TTT AGC ATT ^ G TTT TGG GTA 4S : 
lOQ ^ P los AEP AS " Phe ??? Ile Asn Phe Tr P Val 



500 
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■45 



65 



GAG TAT ACA ATA ATT AGT TGT ATA AAA AAT GAC TCA GGA TGG AAA GTA 
Glu Tyr Thr lie lie Ser Cys lie Lys Asn Asp Ser Gly Trp £JJ vll 

135 140 145 

I" A ™ ™ G GGA AAT AGA ATA ATA TGG ACA TTA ATA GAT GTT AAT GCA 
-ei lie Lys Gly Asn Arg lie lie Trp Thr Leu lie Asp Val Asn Ala 

AAA TCT AAA TCA ATA TTT TTC GAA TAT AGT ATA AAA GAT AAT ATA TCA 
Lys Ser Lys Ser He Phe Phe Glu Tyr Ser lie Lys Asp Asn He Ser 

170 175 

A^o Itl rT A ^ T *** TGG TTT TCC ATA ACT ATT A ^T AAT GAT AGA TTA 
Asp lyi lie Asn Lys Trp Phe Ser Ue Thr lie Thr Asn Asp Arg "u 



190 



195 



255 



I™ A T T ^ I CA TCT ACA ** r ACT TTA AAA GAT TTT TGG GGG AAT CCT 
•ip He Gin ser Ser Thr Asn Thr Leu Lys Asp Phe Trp Gly HI Pro 
265 270 275 

ll A AGA £ AC GAT ACA CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT 
Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly £? G^ HI 

40 280 2 85 290 



I'e Jvr n A ^ T IT AGT *** GCT TCT ATG GGG GAA ACT GCA CCA 

He Tyi He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro 

295 300 3 o5 



CGT ACA AAC TTT AAT AAT GCA GCA ATA AAT TAT CAA AAT TTA TAT CTT 
Arg Thr Asn Phe Asn Asn Ala Ala lie Asn Tyr Gin Hn llu Tvr III 

315 320 

GGT TTA CGA TTT ATT ATA AAA AAA GCA TCA AAT TCT CGG AAT ATA AAT 
Gly Leu Arg Phe lie Ue Lys Lys Ala Ser Asn Ser Arg Asn He HI 



335 



AAT GAT AAT ATA GTC AGA GAA GGA GAT TAT ATA TAT CTT AAT ATT GAT 
Asn Asp Asn lie Val Arg Glu Gly Asp Tyr lie Tyr ill HI ill ts P 



350 



355 



AAT ATT TCT GAT GAA TCT TAC AGA GTA TAT GTT TTG GTG AAT TCT AAA 
Asn He Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu vll HI sir L^ 
60 360 3 65 3 70 

GAA ATT CAA ACT CAA TTA TTT TTA GCA CCC ATA AAT GAT GAT CCT ACG 
Glu lie Gin Thr Gin Leu Phe Leu Ala Pro He Asn Asp Asp Pro ?£? 
375 380 385 



Phe ?yr HI vll lIu G^ ft* *** ^ ™ T TAT GAA **A ACA ACA TAT 
Fne Tyr Asp Val Leu Gin lie Lys Lys Tyr Tyr Glu Lys Thr Thr Tyr 

395 400 



546 



596 



644 



692 



GGT AAC GCA AAT ATT TAT ATA AAT GGA AGT TTG AAA AAA AGT GAA AAA 
Gly Asn Ala Asn lie Tyr U e Asn Gly Ser Leu Lys £JJ ser Glu JCi 

20 '°° 205 210 

ATT TTA AAC TTA GAT AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA 
xle Leu Asn Leu Asp Arg He Asn Ser ser Asn Asp He Asp Phe 
- 15 220 225 

25 TTA ATT AAT TGT ACA GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT 

Leu He Asn Cys Thr Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe 

235 240 

AAT ATT TTT GGT AGA GAA TTA AAT GCT ACA GAA GTA TCT TCA CTA TAT 
->» Asn lie Phe Gly Arg Glu Leu Asn Ala Thr Glu Val sir Ser Leu £yr 



740 



788 



836 



932 



980 



1028 



1076 



1172 



1220 



1268 



1316 



» i5 lyl SS iE S £ S £ « S - - % oj. ™ jjj 1364 
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405 410 415 

GGA ATT GGT AAA TTT GTT AAA GAT TAT GGA TAT GTT TGG GAT ACC TAT 1412 

Gly He Gly Lys Phe Val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr 

420 425 430 435 



GAT AAT TAT TTT TGC ATA AGT CAG TGG TAT CTC AGA AG A ATA TCT GAA 14 6 0 

Asp Asn Tyr Phe Cys He Ser Gin Trp Tyr Leu Arg Arg lie Ser Glu 
440 445 450 

AAT ATA AAT AAA TTA AGG TTG GGA TGT AAT TGG CAA TTC ATT CCC GTG 15 OB 

Asn He Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe He Pro Val 
455 460 465 

GAT GAA GGA TGG ACA GAA TAACTCGAG t c . , 

Asp Glu Gly Trp Thr Glu 
470 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
1 5 io 



15 



He Glu Gly Arg His Met Ala Ser Met Ala Asp Thr lie Leu He Gin 
20 25 30 

Val Phe Asn Asn Tyr He Ser Asn He Ser Ser Asn Ala He Leu Ser 

40 45 

Leu Ser Tyr Arg Gly Gly Arg Leu lie Asp Ser Ser Gly Tyr Glv Ala 
50 55 60 

Thr Met Asn Val Gly Ser Asp Val lie Phe Asn Asp He Gly Asn Glv 
65 7 0 75 80 

Gin Phe Lys Leu Asn Asn Ser Glu Asn Ser Asn lie Thr Ala His Gin 
85 90 95 

Ser Lys Phe Val Val Tyr Asp Ser Met Phe Asp Asn Phe Ser He Asn 
100 105 no 

Phe Trp Val Arg Thr Pro Lys Tyr Asn Asn Asn Asp He Gin Thr Tyr 
115 120 125 

Leu Gin Asn Glu Tyr Thr lie lie Ser Cys He Lys Asn Asp Ser Gly 
130 135 140 

Trp Lys Val Ser He Lys Gly Asn Arg lie lie Trp Thr Leu lie Asp 
145 150 155 160 

Val Asn Ala Lys Ser Lys Ser He Phe Phe Glu Tyr Ser lie Lys Asp 
165 170 175 

Asn lie Ser Asp Tyr He Asn Lys Trp Phe Ser lie Thr lie Thr Asn 
180 185 190 

Asp Arg Leu Gly Asn Ala Asn lie Tyr lie Asn Gly Ser Leu Lys Lys 
19 5 200 205 

Ser Glu Lys He Leu Asn Leu Asp Arg He Asn Ser Ser Asn Asp He 
210 215 220 
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Asp Phe Lys Leu He Asn Cys Thr Asp Thr Thr Lys Phe Val Trp lie 
' 25 230 235 

Lys Asp Phe Aon lie Phe Gly Arg Glu Leu Asn Ala Thr Glu Val Ser 
245 250 255 

Ser Leu Tyr Trp lie Gin Ser Ser Thr Asn Thr Leu Lys Asp Phe Trp 
260 26S 270 

Gly Asn Pro Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly 
275 280 285 

Met Gin Asn He Tyr lie Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu 

295 300 

Thr Ala Pro Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn 
JOi> 310 315 320 

Leu Tyr Leu Gly Leu Arg Phe He lie Lys Lys Ala Ser Asn Ser Arg 
325 330 335 

Asn lie Asn Asn Asp Asn lie Val Arg Glu Gly Asp Tyr He Tyr Leu 
340 345 350 

Asn He Asp Asn lie Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu Val 
355 3 6 o 36S 

Asn Ser Lys Glu He Gin Thr Gin Leu Phe Leu Ala Pro He Asn Asp 
i u 375 380 

Asp Pro Thr Phe Tyr Asp Val Leu Gin He Lys Lys Tyr Tyr Glu Lys 

Thr Thr Tyr Asn Cys Gin He Leu Cys Glu Lys Asp Thr Lys Thr Phe 
405 410 415 

Gly Leu Phe Gly He Gly Lys Phe Val Lys Asp Tyr Gly Tyr Va' Trp 
420 425 430 V 

Asp Thr Tyr Asp Asn Tyr Phe Cys He Ser Gin Trp Tyr Leu Arg Arg 

^<l-> 44 0 445 

lie Ser Glu Asn lie Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe 
SbU 455 460 

lie Pro Val Asp Glu Glv Tro Thr Glu 
465 476 

(2) INFORMATION FOR SEQ ID NO: 80: 

(ii SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3 0 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(Xi> SEQUENCE DESCRIPTION ; SEQ ID NO: 80: 

CGCCATGGCT GACACAATTT TAATACAAGT 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION ; SEQ ID NO: 81: 

GCCTCGAGTT ATTCTGTCCA TCCTTCATCC AC 

(2) INFORMATION FOR SEQ ID NO: 82: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

{ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME / KEY : Modi f ied-si te 
<B) LOCATION: 12 

(D) OTHER INFORMATION: /note= "The asoaracHno 
th ls position contains an amide group." as P ar ^^e residue at 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Cys Gin Thr lie Asp Gly Lys Lys Tyr Tyr Phe Asn 
5 10 
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CLAIMS 

I • A host cell containing a recombinant expression vector, said vector encoding a 
protein comprising at least a portion of a Clostridium botulinum toxin, said toxin selected 
from the group consisting of type B toxin and type E toxin. 

2. The host cell of Claim I . wherein and said host cell is capable of expressing 
said protein at a level greater than or equal to 5% of the total cellular protein. 

3. The host cell of Claim 1 . wherein and said host cell is capable of expressing 
said protein as a soluble protein at a level greater than or equal to 0.25% of the total soluble 
cellular protein. 

4. The host cell of Claim 1. wherein said host cell is an Escherichia coli cell, 
a. The host cell of Claim I. wherein said host cell is an insect cell. 

6 The host cell of Claim I. wherein said host cell is a yeast cell. 

7. A host cell containing a recombinant expression vector, said vector encoding a 
fusion protein comprising a non-toxin protein sequence and at least a portion of a Clostridium 
hmulimm, toxin, said toxin selected from the group consisting of type B toxin and type I: 
toxin. 



8. The host cell of Claim 7. wherein said portion of said toxin comprises the 
receptor binding domain. 

9. The host cell of Claim 7. wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

10. A vaccine comprising a fusion protein, said fusion protein comprising a non- 
toxin protein sequence and at least a portion of a Clostridium botulinum toxin, said toxin 
selected from the group consisting of type B toxin and type E toxin. 
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11. The vaccine of Claim 1 0 further comprising a fusion protein comprising a non- 
loxin protein sequence and at least a portion of Clostridium botulinum type A toxin. 

1 2. The vaccine of Claim 1 0. wherein said portion of said Clostridium botulinum 
toxin comprises the receptor binding domain. 

1 3. The vaccine of Claim 1 0 wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

14. The vaccine of Claim 1 0. wherein said vaccine is substantially endotoxin-free. 

1 5. A method of generating antibody directed against a Clostridium botulinum 
toxin comprising: 

a) providing in any order: 

0 an antigen comprising a fusion protein comprising a non-toxin 
protein sequence and at least a portion of a Clostridium botulinum toxin, said 
toxin selected from the group consisting of type 13 toxin and type E toxin, and 

ii) a host; and 

b) immunizing said host with said antigen so as to generate an antibody. 

16. The method of Claim 1 5, wherein said antigen further comprises a fusion 
protein comprising a non-toxin protein sequence and at least a portion of Clostridium 
botulinum type A toxin. 

1 7. The method of Claim 1 5. wherein said portion of said Clostridium botulinum 
toxin comprises the receptor binding domain. 

1 8. The method of Claim 1 5 wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

19. The method of Claim 15 wherein said host is a mammal. 

20. The method of Claim 19 wherein said mammal is a human. 
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21. The method of Claim 15 further comprising step c) collecting said antibodies 
from said host. 



22. The method of Claim 21 further comprising step d) purifying said antibodies. 

5 

23. The antibody raised according to the method of Claim 15. 

24. The antibody raised according to the method of Claim 16. 
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FIGURE 1 
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FIGURE 3 
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FIGURE 4 
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FIGURE 5 
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FIGURE 6 




Fragment 



Fragment 3 



EcoRl 
t 



PstI 



P3 



Fragment 2 



P4 



P1-P4 *rePCR primers 1-4. P1=5 , GGAAATTTAGCTGCAGCATCTGAC3 , ( 
PZ=5TCTAGCAMTTCGCrTGTGTTGAA3\P3=5 , CTCGC^ 

P4=S , C^ATCTAGGCCTAAAGTAT3^ Indicated restriction sites in fragments 1 and 2 are internal 
sites used to done into pGEX2T vector (fragment 1 ; construct called pGA3O-660) or pMALc 
vector (fragment 2; construct called pMA660-1 100). Bracketed restriction sites at ends of 
fragment 3 are pUC9 polytinker sites utilized to clone fragment 3 into pET23 vector 
(construct called pPAl 1 00-2680). Numbers in these constructs refer to toxin A amino acid 
interval that is expressed The shaded portion of the toxin A gene corresponds to the repeating 
ligand binding domain. 
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FIGURE 8 



pPA660-2366 
pPA1 100-2366 
pPA30-2366 



pPA30-2190 C 
pMA3O-1870 C 
pMA66O-1870 



Toxin A f 



Hinctl EwRI Pstl Spol EooRI p^t! 



pP relors to pET23 vector. pM refers to pMALc vector. A refers to toxin A, and numbers refer to 
amino acid intorvai expressed in done. Endpoints of clones correspond to Indicated restriction sites 
shown of toxin A map. 
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FIGURE 9 
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Toxin A I ■ 


Clal 

1 




Hindi Pstl 


EcoRI Pstl Stul Spcl Pstl 


Pstl 


PMA30-270 f • 1 








pMAiirwifim I i 




PMA30O-660 1— 


1 pMAlhKM«/0 1 1 




pMA66O-1100 1 1 pMAI 870-2680 ■■■■MHB 






pMAi4<;ruiR7n 1 ■ 






DPA11 00-1450 1 1 






dPA1 100-1 870 1 ■ 










pP refers to pET23 vector, pM refers to pMALc vector, A refers to toxin A, and numbers refer 
to amino acid interval expressed in clone. Endpoints of clones correspond to indicated 
restriction sites shown of toxin A map. 
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FIGURE 1 1 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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FIGURE 16 
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Summary of expression constructs from the toxin B gene 
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FIGURE 19 
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FIGURE 20 



Expression constructs from the interval 3 region 
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